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® Polygalacturonase encoding fungal expression system. 

The invention concerns a recombinant DNA molecule coding for a polygalacturonase (PG) expression 
cassette comprising the structural gene of a PG and/or corresponding regulatory sequences, or hybrid vectors 
comprising a DNA derived from a PG gene, further transformed hosts, especially filamentous fungi, methods for 
the preparation of said recombinant DNA molecules and transformed hosts and the preparation of polypeptides 
by means of said DNAs and said hosts, purified single PGs either from natural or transformed hosts, the 
preparation of mixtures of PGs optionaiiy with other enzymes, and the use of PGs or mixtures thereof in 
industry. 
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FUNGAL EXPRESSION SYSTEM 

Field of the invention 

The invention relates to the field of genetic engineenng and prpvides novel DNA molecules comprising 
DNA sequences coding for proteins with polygalacturonase activity and/or the promoter, signal and/or 
5 transcriptional terminator sequences naturally linked thereto. The novel DNA molecules are useful for the 
construction of expression cassettes for the production of polygalacturonases or the expression of foreign 
genes in filamentous fungi. 

10 Background of the invention 



Although in genetic engineering techniques numerous polypeptide expression systems for prokaryotic 
and eukaryotic hosts are already known, there is a continuing need for novel systems which have 
advantages over previous systems. 

75 Very widely used hosts are the prokaryot Escherichia coli and the eukaryotic yeasts, e.g. Sac- 
charomyces cerevisiae , for which a large number of different expression hybrid vectors, mostly plasmids, 
have been developed. The drawbacks of E, coii hosts are that they cannot glycosylate polypeptides and 
that intracellular expression of foreign peptides may lead to accumulation within the host cell and prevention 
of further growth. The yeasts do glycosylate, however, like E. coli , they do not secrete the polypeptides, 

20 except small ones, into the nutrient medium, but into the periplasmic space. Higher eukaryotic hosts such 
as mammalian cancer cells are able to glycosylate and secrete into the nutrient medium, however, 
cultivation thereof is slow and expensive and the danger exists that oncogenic nucleic acids may be 
isolated together with the desired peptide. 

In the search for other hosts filamentous fungi, such as Neurospora crassa , Aspergillus nidulans and 

25 Aspergillus niger . have also been investigated. The application thereof in genetic engineering techniques 
has been slower, mainly for lack of an appropriate transformation system. In contrast to Saccharomyces 
cerevisiae , filamentous fungi do not contain plasmids which could be used for the introduction of foreign 
genes and phenotypic selection, it is, hov/ever, possible to transform filamentous fungi with foreign plasmids 
containing a selectable marker gene. Most vectors described so far for filamentous fungi do not autono- 

30 mously replicate, as those of yeasts do, but are integrated into the funga! chromosome. This event occurs 
generally at a lower frequency. However, integrative transformation renders the transformants mitotically 
very stable, even under non-selective conditions. Stable integration of more than one hundred copies has 
been reported. 

The first vector for filamentous fungi described contained the qa-2 gene of Neurospora crassa as 
35 selectable marker [Case. M.E.. Schweizer. iVl., Kushner. S.R. and Giles, N.H. (1979) Proc. Natl. Acad. Sci. 
USA 76,5259-5263: Case. M.E. (1982) in Genetic Engineering of Microorganisms for Chemicals (Hollander, 
A.. DeMoss. D., Kaplan. S.. Konisky. J., Savage, D. and Wolfe. R.S., eds). pp. 87-100, Plenum]. 

In Aspergillus nidulans , which has a sexual cycle and is therefore amenable to classical genetic 
manipulations, both negative and positive selection systems have been described (Ballance et al. BBRC 
40 ri_2 .284.1983; Tiiburn et al. Gene 26 . 205,1983; Yelton et aL PNAS 81 , 1470, 1984; Yelton and 
Timberlake J. Cell. Biochem. Suppi. 9C , 173. 1985; Johnstone et al. EMBO j7~4 . 1307, 1983; Tiiburn et al. 
Gene 26 , 205, 1983; Wernars et al., Curr. Genet. 9 , 361, 1985; Kelly, J.M, et al~ EMBO J. 4 , 475. 1985). 

Compared to N. crassa or A. nidulans , A. niger is by far the more important organism, as it is used 
widely in the industrial production of enzymes, e.g. for use in the food industry. A. niger is known to secrete 
45 a variety of hydrolytic enzymes, e.g. giucoamylase, a-amylase, pectinase, "cellulase, ^-glucanase, jS- 
galactosidase. naringinase, pentosanase, acid protease and lignase, the giucoamylase and pectinase 
complex being the most important ones. 

A. niger has no known sexual cycle. Mutations can therefore not be introduced via meiotic recom- 
bination. By classical mutation and selection procedures, extensive strain improvements in the secretion of 
50 hydrolytic enzymes have however been achieved. 

Of the genes of A. niger enzymes only those of giucoamylase (Boel et ai. EMBO J. 3 , 1581, 1984) and 
alcohol and aldehyde dehydrogenase (WO 86/06097) together with their promoter and" signal sequences 
have been characterised and used in transformation experiments with A. nidulans and A. niger . 

As selection markers for A. niger the heterologous amds gene (Kelly and Hynes", EMBO J. 4 . 475, 
1985). and the argB gene (Buxton et ai., Gene 37 .207,1985; EP 184 438; WO 86/06097). both obtained 
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from A. ntdulans , have been used. 

A. niger is the most important organism for the industrial production of pectin degrading enzymes, e.g. 
polygalacturonases, pectin lyases or pectin esterases. Pectins are pofygalacturonides of high molecular 
v/eight (20000-40000 D) consisting of a-1 ,4-glycosidic bounded D-gaiacturonic acid polymers. Some of the 

5 uronic acid groups are esterified with methanol which can be split off by pectin esterase. Pectins occur in 
nature as constituents of higher plant cells, where they are attache^d to cellulose molecules mainly found in 
the primary cell wall and the middle lamella. Amongst the richest sources of pectin are lemon and orange 
rind, which contain about 30 % of this polysaccharide. Pectic enzymes degrade the carbohydrate polymer 
substrate either by hydrolysis of the a-1 ,4-glycosidic bond (endo- and exo-potygalacturonases) or by 

70 transelemination (pectin lyases) leading to a satured and to an a-4,5 unsaturated poly- or oligogalacturonide. 
The systematic name of endo-polygalacturonase is (poly-(1 ,4-a-D-gaiacturonide))glycanhydrolase (EC 
3.2.1.15) and of exo-polygalacturonase is (poly-(1,4-a-D-gaIacturonide))-galacturonohydrolase (EC 3.2.1.67). 
Another relevant pectin degrading galacturonase enzyme is rhamnogalacturonase which hydrolyses bonds 
between ot-D-galacturonate and L-rhamnose. 

15 While pectin lyases are specific for highly esterified pectins, polygalacturonases hydrolyse low es- 
terified pectins. The combined action of pectin esterases and polygalacturonases can also depolymerize 
highly esterified pectins. 

The applications of polygalacturonase and enzyme mixtures thereof in fruit and vegetable processing 
(Table 1) have developed from the original uses of pectic enzymes for treatment of soft fruit to ensure high 

20 yields of juice and pigments upon pressing and for the clarification of raw press juices. Technical enzyme 
preparations in use for these processes contain pectin esterases, polygalacturonases and pectin lyases in 
varying amounts along with other enzymes such as arabinanases, galactanases, xylanases, ^-1,4- 
glucanases, glycosidases and proteases. 

Fungal pectinase preparations containing predominantly endopolygalacturonase and being free of pectin 

25 esterase are used successfully as macerating enzymes for the production of pulpy nectars which have a 
more smooth consistency and have higher contents in soluble solids, pigments and nutrients than products 
prepared by a mechanical-thermal process. 

Table 1 (from ref.32) 

30 



Use of polygalacturonases and mixtures thereof in fruit and vegetable industry. 


Enzymes 


Use 


Polygalacturonase 

Pectlnesterase + Polygalacturonase 
and/or Pectin lyase 

Pectinesterase/Polygalacturonase/Pectin 

lyase + (Hemi-) 

Celluiases 


Maceration, Citrus juice 
stabilization/viscosity reduction 
Juice clarification, juice/oil 
extraction. Citrus peel oil, citrus pulp 
wash 

Liquefaction, clear/cloudy juices 

Enhance natural product extraction 
Valorization biomass/feed 



45 

The presence of pectin esterase can easily transform the macerating activity of a pure polygalac- 
turonase into cell disintegration activity because of the general depolymerizing activity of the pectin 
esterase/polygalacturonase combination. 

Through the use of pectic and cellulolytic enzymes the cell walls of fruit pulps can be degraded to the 
50 stage of almost complete liquefaction. The presence of both endo- and exo- j9-1.4 glucanases (celluiases) 
as well as pectic enzymes is essential (ref. 31). 

Polygalacturonase and enzyme mixtures thereof are also useful for liquefaction and saccharifaction of 
biomass, for example, for the production of fermentable polysaccharides from plant cells (ref. 33) or for the 
modification of pectins (for reviev/ see ref. 32). Polygalacturonases in combination with xylanases are also 
55 useful in the paper pulping industry for instance (ref. 44). 

In A. niger the proteins of the pectic complex are not expressed constitutively. Under inducing 
conditions using pectin or breakdown products thereof A. niger expresses the above mentioned enzymes, 
when other carbon sources, such as glucose or sucrose, are growth limiting. In surface cultures the pectic 
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enzymes tend to remain associated with the outer cell wall. 

Both Rexova-Benkova and Markovic (ref. 29) and Rombouts and Piinik (ref. 15) disclose a number of 
polygalacturonases obtained from Aspergillus niger and other fungi; as well as from bacteria and plants, 
v/hich were described as purified. However, as no structural data were given there is still a need for single 
5 polygalacturonases with high specific activity. Advantageously they should be pure enough to allow the 
determination of their amino acid sequences. 

Hereinafter, polygalacturonases are also named PG. 



10 Object of the invention 



Objects of the present invention are purified single PGs either from natural or transformed hosts. 
The present invention is based on the purification and partial structure determination of single PGs, e.g. 
PGI, 11. IIIA, IIIB, IV, in particular PGI or PGII, which allows the synthesis of DNA probes coding for relevant 
;5 parts of the protein. 

An object of the present invention is to screen for and isolate by means of the DNA probes, DNA 
sequences coding for PGM or PGI, eventually together with pre- and post-sequences thereof, from a gene 
library of A. niger . A further object is to identify further PG genes by hytiridization of parts of the PGII gene 
(named pgall ) or the PGI gene (named pgal ) with a gene library of a fungal strain, e.g. of A. niger . 

20 Further objects are recombinant DNA molecules coding for polygalacturonase gene~expression cas- 
settes which comprise a structural gene of a polygalacturonase, optionally with intron sequences, and/or 
regulatory sequences of a PG gene, e.g. promoter, signal and/or transcriptional terminator sequences. 
Further recombinant DNA molecules of the invention are, for example, hybrid vectors comprising DNA 
derived from a PG gene of the invention. 

25 Further objects are hosts, especially filamentous fungi, e.g. Aspergillus hosts, transformed with said 
vectors, methods for the preparation of the recombinant DNA molecules and the transformed hosts and the 
use of the recombinant DNA molecules for the preparation of the novel expression cassettes or hybrid 
vectors. 

An object of the present invention is also the overproduction of PGs, for example in Aspergillus species, 
30 and the production of the single PGs which are uncontaminated with other PGs, or of predetermined 
artificial mixtures thereof. 

Another object concerns the production of any heterologous protein the structural gene of which can be 
expressed under the control of the present recombinant PG DNAs. 

The various subjects of the invention will become more evident from the following detailed description 
35 of the invention. 



Detailed description of the invention 



40 

The Recombinant DNA Molecules 



The present invention concerns a recombinant DNA molecule comprising a DNA sequence selected 

from 

45 a) the Aspergillus niger DNA insert of pGW1800 or pGW1900. 

b) a DNA sequence which hybridizes to the coding region for the mature PGII or PGI comprised by a 
DNA insert of a) and which comprises a structural gene for a polypeptide with polygalacturonase activity, 
and optionally a promoter, a coding region for a signal or leader peptide and/or a transcriptional 
terminator thereof, 

50 c) a derivative of a DNA sequence defined in a) or b), or 

d) a DNA sequence which codes for a mature PG or a signal peptide or a leader peptide thereof and 
which is degenerate within the meaning of the genetic code with respect to a DNA sequence of a), b) or 

c) . 

More particular, the invention concerns a recombinant DNA molecule comprising a DNA sequence 
55 selected from 

a) the Aspergillus niger DNA insert of pGW1800, 

b) a DNA sequence which hybridizes to the coding region for the mature PGII comprised by the DNA 
insert of a) and which comprises a structural gene for a polypeptide with polygalacturonase activity, and 

5 
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Optionally a promoter, a coding region for a signal or leader peptide and/or a transcriptional terminator 
thereof. 

c) a derivative of a DNA sequence defined in a) or b), or 

d) a DNA sequence which codes for a mature PG or a signal peptide or a leader peptide thereof and 
5 which is degenerate within the meaning of the genetic code with respect to a DNA sequence of a), b) or 

c), or a recombinant DNA molecule comprising a DNA sequence, selected from 

a) the Aspergillus niger DNA insert of pGW1900. 

b) a DNA sequence which hybridizes to the coding region for the mature PGI comprised by the DNA 
insert of a) and which comprises a structural gene for a polypeptide with polygalacturonase activity, and 

10 optionally a promoter, a coding region for a signal or leader peptide and/or a transcriptional terminator 
thereof 

c) a derivative of a DNA sequence defined in a) or b), or 

d) a DNA sequence which codes for a mature PG or a signal peptide or a leader peptide thereof and 
which is degenerate within the meaning of the genetic code with respect to a DNA sequence of a), b) or 

15 C). 

The plasmids pGW1800 and pGW1900 are comprised in the E. coll strains JM109/pGWl800 and 
DH5aFVpGW1900, respectively, deposited with the Deutsche SammlungTUr Mikroorganismen und Zellkul- 
turen (DSM). 

Polypeptides with polygalacturonase activity are herein also referred to as polygalacturonases (PGs), 

20 This term also includes fragments or mutants of the naturally occurring polygalacturonases which retain 
enzymatic activity. The terms polygalacturonase or PG used herein also include other gaiacturonases 
having as substrate an oligo-or polysaccharide comprising galacturonate, e.g. oligogaiacturonase cleaving 
an oligogalacturonate or rhamnogalacturonase cleaving a copolymer of rhamnose and galac turonate. or 
fragments or mutants thereof retaining enzymatic activity. A "leader peptide" is intended to be the 

25 sequence of a prepro-PG which is cut off during the formation of a mature PG. A "signal peptide" is cut off 
in a first step by the "signal peptidase" in the endoplasmatic reticulum. 

A recombinant DNA molecule of the present invention comprises, for example, the Aspergillus niger 
DNA insert of pG W1 800 or pG W1 900. 

The 4.1 kb long A. niger N400 insert of pGW1800 (see Fig. 2) extends from the Xbal restriction site 

3Q (map position 1) to the'EcoRI restriction site (map position 4 100). This insert comprises the promoter and 
the transcriptional terminator region of the PGIi gene pgall as well as the coding region for prepro-PGIl, 
including introns. The DNA sequence of the A. niger insert fragment comprising pga II starting at the Xbal 
site in the approximate map position 1 and eTTding at the Pvull site in the approximate map position 3050 
shown in Figure 2 was determined and is depicted in the sequence listing under the Sequence Identification 

35 Number (SEQ ID NO.) 2. 

The promoter region of pga II extends upto position 1357 of this DNA sequence with the Sequence 
Identification Number (SEQ ID NO.) 2. The coding region for the leader peptide of prepro-PGIl extends from 
position 1357 up to position 1437. The signal peptide is encoded by the nucleotides 1357 up to 1419. The 
coding region for the mature PGII extends from position 1438 to position 2494. it is followed by the 

40 transcriptional terminator region which starts at position 2499. 

The 8.6 kbp long A. niger N400 insert of pGW1900 (see Fig. 3) extends from the Bam HI site in map 
position 1 up to the Bam HI site in the approximate map position 8600 indicated in Figure 3. The DNA 
sequence extending from about map position 6280 up to about 3880 is given in the sequence listing under 
SEQ ID NO. 1. 

45 The promoter region of the 8.6 kbp insert extends upto position 910 of the DNA sequence with SEQ ID 
No. 1, The coding region for the 31 amino acid long leader peptide of prepro-PGI extends from position 910 
up to position 1002. The signal peptide is encoded by nucleotides 910 to 963. The coding region for mature 
PGI starts at position 1003 and extends up to 2127. The transcriptional terminator region starts at position 
. 2131. 

50 DNA sequences coding for PGI or PGII derived from the A. niger N400 inserts of pGW1900 or 
pGWlSOO hybridize under "homologous" conditions with the co'ding region of the PGI or PGII gene, 
respectively, or of a fragment thereof derived from A. niger N400. DNA sequences which are at least 
partially homologous to the A. niger N400 PGI or PGIlToding region and encode a protein with PG activity 
are members of the PG geni family. DNA sequences which are only partially homologous hybridize with 

55 the A. niger N400 PGI or PGII derived sequences under so called "heterologous" conditions which are less 
stringent than the "homologous" conditions. A member of the PG gene family which only hybridizes under 
"heterologous" conditions can be another PG gene of A. niger N400 than the PGI or PGII gene or a PGI, 
PGIi or a different PG gene of a PG producing fungal "strain other than A. niger N400. Such fungal strain 
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can be derived, for example, from Aspergillus spec .. e.g. A. japonicus . A. oryzae , A. nidulans , or an A. 
niger strain other than N400, Trichoderma spec, , Botrytis lpec SclerotFnia spec "Fusariunn spec , or 
other phytopathogenic fungi as well as from yeast, for example from PG producing Kluyveromyces s^Dec . 
such as |C fragilis . A preferred example for such a strain is A. niger NW756. Thus, the PG gene farnii7"of 
5 the invention comprises the genes encoding PGI, PGII. PGIIIA. PGIIIB. PGIV and other PGs, preferentially 
derived from Aspergillus spec, more preferentially from A. niger . most preferentially from strain N400 or 
NW 756, but also from other PG producing fungal strainsTMembers of the PG gene family are named "PG 
genes". 

The protein products of the members of the PG gene family include enzymes ("galacturonases") which 

70 can cleave ollgo- or polysaccharides comprising galacturonate, e.g. oligogalacturonase, rhamnogalac- 
turonase or in particular polygalacturonase. 

A conventional hybridization procedure is described e.g. by Benton and Davis (ref. 7). Homologous and 
heterologous hybridization conditions are more precisely defined hereinafter. 

The term "derivative" when used in connection with the novel DNA sequences of the Invention is 

75 intended to include larger derivatives containing flanking sequences, fragments of said DNA sequences and 
mutants, especially artificial mutants. 

Larger derivatives of the novel DNA sequences are those excisable from the A. niger genome and 
comprising the DNA sequences which hybridize to the coding region for the mature PGII or PGI comprised 
by the Aspergillus niger DNA insert of pGW1800 or pGW1900, which code for a polypeptide with 

20 polygalacturonase activity, and which may contain the same or different promoter, signal and/or transcrip- 
tional terminator sequences as the Aspergillus niger DNA insert of pGWlSOO or PGW 1900. Such 
derivatives can be found in a genomic library of A. niger N400 or NW756 obtained by fragmentation of the 
nucleic acids, treatment of the fragments with a Tuitable restriction enzyme, e.g. EcoRI, BaMHI or Hindlll. 
ligating into a suitable vector, e.g. the lambda phage or the plasmid pBR322, cloning, e.g. in E. coli, and 

25 excising again with a suitable restriction enzyme. 

A preferred example for a derivative of a DNA sequences hybridizing with the coding region of the A. 
niger N400 PGI or PGII gene under "heterologous" conditions is an insert or fragment thereof of a vector 
selected from the group of vectors consisting of XPG-A9. -A10, -A43, -84, -835, -836, -CI. -C15, -C20. 
-C37. -D8, -D11, -D12, -E6, -E38, -F5, -G17. -G27, -X31 and -Y33. pGW1756 and pGW1910. The latter one 

30 comprises the structural gene pga C encoding A. niger N40O PGC derived from XPG-C20. pGW1756 
(Figure 6) comprises the A. niger NW756 derived "PGII gene which shows a restriction pattern and a DNA 
sequence (see SEQ ID Na 3) distinct from the A. niger N400 PGII gene. The strains NW756 and NW400 
are therefore not closely related and might belong~to different species of filamentous fungi. Accordingly, the 
A. niger N400 PGI or PGII structural gene can specifically hybridize under heterologous conditions also with 

35 PG genes of other species. 

Plasmid pGW1756 is composed of an about 4000 bp long pEMBL18 vector fragment and an about 
5500 bp long A. niger NW756 DNA fragment. The PGlt encoding gene pga (I is located within an about 
3300 bp long Hindi restriction fragment. pGW1756 is deposited with the DSM^ 

The promoter region of the A. niger NW756 pga II extends before nucleotide 889 in the DNA sequence 

40 with SEQ ID NO. 3 given in the sequence listing. The coding region for the leader peptide of prepro-PGlt 
presumably extends from position 890 up to 970 whereas the signal peptide is encoded by nucleotides 890 
to 952. The coding region for the mature protein starts with nucleotide 971, ends with the stop codon at 
position 2028 and contains presumably one intron. The coding region is followed by the transcriptional 
terminator region which starts at position 2031 . 

45 Plasmid pGW1910 (Figure 7) consists of an approximately 7800 bp long insert derived from XPG-C20 
and of the vector pUG9. Within the A. niger N400 insert the polygalacturonase gene pga C Is located. The 
DNA sequence of nearly the entire pga C gene is given in the sequence listing under SEQ ID NO. 4. 

The promoter of the A, niger N400 pga C extends before nucleotide 663. The leader peptide of the 
prepro-PG encoded by pga 0 extends presumably from nucleotide 663 up to 782 and includes the DNA 

50 sequence from 663 up to 710 which codes for the signal peptide. The mature PG is encoded by the 
sequence extending from nucleotide 783 up to 1995 and including several introns. The transcriptional 
terminator region starts after the stop codon at position 1999. 

Derivatives of hybridizing DNA sequences encompassed by the present invention comprise the same 
promoter, signal and/or transcriptional terminator sequence as in pGW1800 or as in pGW1900, or another 

55 promoter, signal and/or transcriptional terminator sequence which may be derived from another gene of the 
polygalacturonase gene family. Furthermore, any other promoter, signal and/or transcriptional terminator 
sequence may be attached to the hybridizing DNA sequences depending on the host in which the desired 
PG is to be expressed. 

7 
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A wide variety of promoters signal sequences and transcriptional terminators may be employed. 
Promoters are commonly available from 5 noncoding regions of prokaryotic. eukaryotic or viral genes, 
whereas transcriptional terminators are available from 3 non-coding regions. Signal sequences can be 
obtained from the 5 coding region of preproteins which are introduced into the secretory pathway of the 

5 cell. Examples for promoters are given hereinafter. 

A signal sequence in context with the present Invention is a D.NA sequence encoding a signal peptide 
or a leader peptide of a prepro-PG of the invention. 

Flanking sequences within the meaning of the invention are also DNA fragments which are not derived 
from sequences flanking a PG gene in the genome. Such flanking sequences, for example, have regulatory 

70 functions, e.g. promoter function, or encode a polypeptide, e.g. a structural gene, or are linkers which put 
regulatory sequences and structural gene into the correct reading frame or distance, or sequences derived 
from a vector, e.g. a phage or plasmid, used injhe construction of an expression plasmid. Such flanking 
sequences may give rise to expression cassettes or fusion genes. 

The term "fragment" when used in connection with the novel DNA is intended to include also fragments 

75 of the larger derivatives, particularly those retaining promoter, signal, structural or transcriptional terminator 
functions. They may extend between two restriction sites. 

Preferred DNA fragments are those containing a promoter region, encoding a leader or signal peptide of 
a prepro-PG, or encoding a polypeptide with polygalacturonase activity, or containing a transcriptional 
terminator region. Accordingly, a DNA sequence of the invention is also such containing any combination of 

20 said fragments, e.g. those containing a promoter, and/or a coding region for a leader or signal peptide 
and/or for a PG and/or a transcriptional terminator, and the like. 

A fragment comprising a PG promoter region extends in the DNA sequence before a prepro-PG 
structural gene up to about 2000, preferably up to about 500 to 1400 nucleotides upstream. A promoter 
region binds RNA polymerase as well as regulatory proteins. 

25 A fragment comprising a promoter region of the PGI gene pga I extends, for example, from position 1 
up to position 909 of the sequence with the SEQ ID NO. 1. A fragment comprising a promoter region of the 
PGIi gene pga II of A niger N400 extends, for example, from position 1 up to position 1356 of the sequence 
with the SEQlD N072. In the A. niger NW756 insert of pGW1756 a fragment comprising a promoter region 
of pgall extends, for example, from position 1 up to 889 of the sequence with SEQ ID NO. 3. In the A. niger 

30 insert of pGW1910, a fragment comprising a promoter region of pga C extends from position 1 to 662^of the 
sequence with SEQ ID NO. 4. " 

A fragment comprising the DNA sequence coding for the leader peptide of a prepro-PG extends for 
example between the end of the promoter and the beginning of the sequence coding for the mature protein. 
A coding region for a signal peptide is shorter than for the corresponding leader peptide. It is also starting 

35 at the end of the promoter and extends up to a coding region for a signal peptidase cleavage site. In the A. 
niger N400 prepro-PGII, the leader peptide comprises 27 amino acids. A DNA fragment encoding the leader 
peptide extends for example from the ATG codon at position 1357 up to the Xhol cleaving site at position 
1429 of the DNA sequence with SEQ ID NO. 2. In prepro-PGI the leader peptide has 31 amino acids. A 
DNA fragment coding for this leader peptide extends for example between positions 910 and 1002 of the 

40 DNA sequence with the SEQ ID NO. 1. The leader peptide of the A. niger NW756 prepro-PGII has 
presumably 27 amino acids and thus the DNA fragment encoding the leader peptide is the 81 -bp long DNA 
fragment extending between position 889 and 971 in the DNA sequence with SEQ ID NO. 3. The DNA 
fragment encoding the probably 40 amino acid leader peptide of the pga G product of A. niger N400 
extends from base position 653 up to 782 of the sequence with SEQ ID NOTi. 

45 The coding regions for the respective signal peptides are. for example, located on following DNA 
fragments: Base positions 911 to 963 in SEQ ID NO. 1 1358 to 1419 in SEQ ID NO. 2. 890 to 952 in SEQ 
ID NO. 3 and 663 to 710 in SEQ ID NO. 4. 

A fragment comprising the entire coding region for mature PGI extends, for example, from position 
1003 up to position 2127 of the sequence with SEQ ID NO. 1. A fragment comprising the entire coding 

50 region for mature A. niger N400 PGII extends, for example, from position 1430 up to 2497 of the sequence 
with SEQ ID NO. 27 Mature A. niger NW756 PGII is encoded, for example, by the DNA fragment extending 
from position 953 to 2027 oflhe sequence with SEQ ID NO. 3 and mature pga C gene product is encoded, 
for example, by the DNA fragment located between bases No. 782 and 1996lDf the sequence with SEQ ID 
NO. 4. However, also shorter fragments may encode polypeptides retaining PG activity. 

55 The structural genes of PGII or PGI as well as of other PGs may contain introns like many fungal 
genes. However, included within the scope of the present invention are also intron-free structural genes of 
PGs. e.g. such which do not contain introns as genomic DNA sequence or also such which can be obtained 
from cDNA. 
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A fragment comprising a transcriptional terminator region extends, for example, from a stop codon. e.g. 
TAA, TAG or TGA, at the end of the coding region for a PG, up to about 300 to 2000, preferentially up to 
about 500 to 1000 base pairs in the downstream direction. 

Such a fragment, for example, extends from base position 2131 up to 2495 of the sequence with SEQ 
5 ID NO. 1, from 2498 to 3031 in the sequence with SEQ ID NO. 2. from 2031 to 3047 in the sequence with 
SEQ ID NO. 3, and from 1999 to 2304 in the sequence with SEQ 1D,N0.4. 

However, the fragments mentioned hereinbefore may be extended by the naturally 5 flanking se- 
quences or they may be shortened at the 5 or 3' end and nevertheless can retain promoter or 
transcriptional terminator activity, or the polypeptides encoded can retain signal peptide or galacturonase 
70 activity. Such fragments are also included within the scope of the invention. 

The fragments hereinbefore may contain or may be flanked by linkers which provide for successful 
linkage to other DNA molecules and/or put DNA fragments having regulatory functions or encoding a 
polypeptide, e.g. promoter and structural gene, into the correct reading frame or distance. 

Suitable linkers to above fragments have a DNA sequence which fits Into the restriction site of the DNA 
75 to which the fragment is to be linked. They may contain a predetermined restriction site. 

Mutants of a DNA sequence of the invention are e.g. naturally occurring mutants. The invention 
comprises also natural or synthetic mutants of the coding region for the signal or leader peptides with a 
similar or identical hydrophobicity profile, e.g. wherein the codons for the polar amino acids histidine (H*"), 
tyrosine C^') and argintne (R*^^) are exchanged by codons for other amino acids having similar charges, and 
, 20 the hydrophobic amino acids alanine (A), leucine (L) and threonine. (T), are replaced by codons for other 
hydrophobic amino acids. For example, the codon for the positively charged lysine may be replaced by a 
codon for arginlne and vice versa, the codon for the negatively charged tyrosine by a codon for glutamate 
or aspartate, and/or the codon for the non-polar, hydrophobic alanine by any one of the codons for 
threonine, proline, valine, isoleucine. leucine, methionine or phenylalanine, and the like. 
25 A recombinant DNA molecule of the invention also comprises DNA sequences which are degenerated 
within the meaning of the genetic code in that an unlimited number of nucleotides are replaced by other 
nucleotides without changing the amino acid sequence for which they code. Such degenerate DNA 
sequences may be useful because of their different restriction sites or because of preferred codon usage in 
a particular host. 

30 A recombinant DNA molecule of the present invention preferentially comprises the Aspergillus niger 
DNA insert of a hybrid vector selected from the group of hybrid vectors consisting of pGW1800. pGW l"90a 
pGW 1756. pGVV 1910, \PG-A9. \PG-A10. XPG-A43, \PG-B4. XPG-B35. XPG-B36. XPG-Cl, XPG-C16, XPG- 
C20. XPG-C37, XPG-D8. XPG-D11. XPG-D12, XPG-E6, XPG-E38, XPG-F5, XPG-G17, XPG-G27. XPG-X31 and 
XPG-Y33 or a fragment thereof comprising a promoter region of a PG gene, or a coding region for a signal 

35 peptide, leader peptide, prepro-PG, PG or a fragment of a 

PG with polygalacturonase activity, or a transcriptional terminator region of a PG gene. The inserts 
themselves are also covered by the present invention. 

The invention also concerns recombinant DNA molecules which are expression cassettes comprising a 
promoter region, a DNA fragment encoding a leader or signal peptide, a structural gene and/or a 

40 transcriptional terminator region of the invention, preferentially such derived from a vector selected from the 
group of vectors mentioned hereinbefore. 

The term "expression cassette" means a DNA sequence capable of expressing a polypeptide and 
comprises a promoter, if desired a signal sequence, further a structural gene, if desired, a transcriptional 
terminator and optionally a transcriptional enhancer, ribosomal binding site and/or further regulatory 

45 sequences. 

In an expression cassette of the invention, the PG gene derived functional fragments may be combined 
with functional fragments derived from other genes. 

A wide variety of promoter sequences may be employed, depending on the nature of the host cell. 
Promoters that are strong and at the same time well regulated are the most useful. Sequences for the 
50 initiation of translation are for example Shine-Dalgarno sequences. Sequences necessary for the initiation 
and termination of transcription and for stabilizing the mRNA are commonly available from the noncoding 
5 -regions and 3 -regions, respectively, of viral or eukaryotic cDNAS, e.g. from the expression host. 

Examples for promoters are XPl. XPr. E. coli lac, trp, tac, yeast TRP1-. ADHI-. ADHH-, PH03-, PH05-, 
or glycolytic promoters such as the promoter of the enolase, giyceraldehyde-3-phosphate dehydrogenase. 
55 3-phosphoglycerate kinase (PGK), hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, 
phosphoglucose isomerase and glucokinase genes, or promoters derived from eukaryotic viruses, e.g. 
SV40, Rous sarcoma virus, adenovirus 2, bovine papilloma virus, papovavirus, cytomegalovirus derived 
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promoters or mammalian cell derived promoters, e.g. of the actio, collagen, myosin, or jS-globin gene. The 
eukaryotic promoters may be combined with enhancing sequences such as the yeast upstream activating 
sequences (UAS) or viral or cellular enhancers such as the cytomegalovirus IE enhancers, SV40 enhancer, 
immunoglobulin gene enhancer or others. 

5 Enhancers are transcription-stimulating DNA sequences, e.g. derived from viruses such as Simian virus, 

polyoma virus, bovine papilloma virus or Moloney sarcoma virus, or of genomic origin. An enhancer 
sequence may also be derived from the e)ctrachromosomal ribosomal DNA of Physarum polycephalum 
(PCT/EP 8500278), or it may be the upstream activation site from the acid phosphatase PH05 gene (EP 
Appl. No. 86 1-1 1 820-6), or the PH05, trp, PH05-GAPDH hybrid (EP Appl. No, 86 111 820.6), or the like 

10 promoter. 

Signal sequences may be, for example, a presequence or secretory leader directing the secretion of the 
polypeptide, or the like. A signal sequence is, for example, a signal or leader peptide of prepro-PGI, prepro- 
PGII or prepro-PGC. Further signal sequences are known from literature, e.g. those compiled in von Heijne, 
G.. Nucleic Acids Res. 14 ,4683 (1986). 

75 Structural genes in~this context are, besides those coding for PGI or PGM, also those coding for other 
PGs or derivatives, in particular fragments, thereof having PG activity or other Aspergillus genes and 
structural genes which originate from viruses, procaryotic cells or eucaryotic cells and which may be 
derived from genomic DNA or from cDNA prepared via the mRNA route or may be synthesized chemically, 
coding for a wide variety of useful polypeptides, including glycosylated polypeptides, in particular of higher 

20 eukaryotic, especially mammalian, such as animal or especially human origin, such as enzymes which can 
be used, for example, for the production of nutrients and for performing enzymatic reactions in chemistry, 
or polypeptides, which are useful and valuable for the treatment of human and animal diseases or for the 
prevention thereof, for example, hormones, polypeptides with immunomodulatory, anti-viral and anti-tumor 
properties, antibodies, viral antigens, vaccines, clotting factors, foodstuffs and the like. 

25 Examples of such structural genes are e.g. those coding for hormones such as secretin, thymosin, 
reiaxin, calcitonin, luteinizing hormone, parathyroid hormone, adrenocorticotropin. melanoycte-stimuiating 
hormone, jS-lipotropin, urogastrone or insulin, growth factors, such as epidermal growth factor, insulin-like 
growth factor (IGF), e.g. IGF-I and IGF-II, mast cell growth factor, nerve growth factor, giia derived nerve cell 
growth factor, or transforming growth factor (TGF), such as TGF;3. growth hormones, such as human or 

30 bovine grov/th hormones, interleukin, such as interleukin- 1 or -2, human macrophage migration inhibitory 
factor (MIF), interferons, such as human a-interferon, for example interferon-aA, aB, aD or aF, ,i3-interferon, 
x-interferon or a hybrid interferon, for example an aA-aD- or an aB-aD-hybrid interferon, especially the 
hybrid interferon BDBB. proteinase inhibitors such as ai -antitrypsin, SLPI and the iinke, hepatitis virus 
antigens, such as hepatitis B virus surface or core antigen or hepatitis A virus antigen, or hepatitis nonA- 

35 nonB antigen, plasminogen activators, such as tissue plasminogen activator or urokinase, tumour necrosis 
factor, somatostatin, renin, jS-endorphin, immunoglobulins, such as the light and/or heavy chains of 
immunoglobulin D, E or G, or human-mouse hybrid immunoglobulins, immunoglobulin binding factors, such 
as immunoglobulin E binding factor, calcitonin, human calcitonin-related peptide, blood clotting factors, such 
as factor IX or Vlllc, erythropoietin, egiin, such as eglin C, hirudin, desulfatohirudin, such as desulfatohirudin 

40 variant HV1, HV2 or PA, human superoxide dismutase. viral thymidin kinase, ;S-!actamase, glucose 
isomerase. Preferred genes are those coding for a human a-tnterferon or hybrid interferon, particularly 
hybrid interferon BDBB. human tissue plasminogen activator (t-PA), hepatitis B virus surface antigen 
(HBVsAg), insuiin-like growth factor I and II, eglin C and desulfatohirudin. e.g. variant HV1 . In the hybrid 
vectors of the present invention, the present promoter and/or signal sequence is operably linked to the 

45 polypeptide coding region so as to ensure effective expression of the polypeptide. 

The invention also concerns recombinant DNA molecules which are recombinant vectors, alternatively 
called hybrid vectors, which comprise a DNA sequence selected from 

a) the Aspergillus niger DNA insert of pGWISOO or pGW1900, 

b) a DNA sequence which hybridizes to the coding region for the mature PGII or PGI comprised by a 
50 DNA insert of a), 

c) a derivative of a DNA sequence defined in a) or b), or 

d) a DNA sequence of a), b) or c) which codes for a mature PG or a signal peptide or a leader peptide 
thereof and which is degenerate within the meaning of the genetic code. 

They are usable for cloning and/or expression in hosts, such as bacteria, fungi or animal cells. 
55 Such hybrid vectors may be derived from any vector useful in the art of genetic engineering, such as 
from viruses, phages, cosmids, ptasmids or chromosomal DNA, such as derivatives of SV40, Herpes- 
viruses, Papilloma viruses, Retroviruses, Baculovirus, phage X, e.g. NM 989 or EMBL4, or phage Ml 3. e.g. 
M13mp8 phage DNA (ref. 15) linearized by BamHI digestion, bacterial plasmids, e.g. pBR322, pUCl8, or 
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yeast piasmids, e.g. yeast 2u plasmid. or aiso chromosomal DNA. derived e.g. from filamentous fungi such 
as Aspergillus spec e.g. A. niger , for example those provided by EP 184 438, or a defective virus, phage 
or plasmid in the presence~bf a helper virus, phage or plasmid allowing replication of said defective virus, 
phage or plasmid, e.g. M13{ + )KS vector in presence of e.g. M14K07 helper phage. 

5 The hybrid vectors of the invention provide for replication and optionally expression of a desired DNA in 
a suitable host, either as an extrachromosomal element or by integration in the host chromosome. Several 
possible vector systems are available for integration and expression of the cloned DNA of the invention, in 
principle, all vectors which replicate and/or express a desired polypeptide gene comprised in an expression 
cassette of the invention in the chosen host are suitable. The vector is selected depending on the host cells 

10 envisaged for transformation. In general, such host cells may be prokaryotic or eukaryotic microorganisms 
such as bacteria, fungi such as yeasts or filamentous fungi, or cells of higher eukaryotic origin such as 
vertebrate, for example mammalian, cells. Suitable host cells will be discussed in detail hereinbelow. In 
principle, the hybrid vectors of the invention comprise the DNA as defined hereinbefore, an origin of 
replication or an autonomously replicating sequence, dominant marker sequences, optionally expression 

75 control sequences essential for the transcription and translation of the desired DNA and, optionally, for the 
secretion of the desired product and, optionally, additional restriction sites. 

An origin of replication or an autonomously replicating sequence (a DNA element which confers 
autonomously replicating capabilities, to extrachromosomal elements) is provided either by construction of 
the vector to include an exogeneous origin such as derived from Simian virus (SV 40) or another viral 

20 source, or by the host cell chromosomal mechanisms. 

A hybrid vector of the invention molecules of the present invention may contain selective markers 
depending on the host which is to be transformed, selected and cloned. Any marker gene can be used 
which facilitates the selection of transformants due to the phenotypic expression of the marker. Suitable 
markers are particularly those expressing antibiotic resistance, e.g. against tetracycline or ampicillin, or, in 

25 the case of auxotrophic fungal mutants, genes which complement host lesions. Corresponding,. genes 
confer, for example, resistance to the antibiotic cycloheximide, or provide for prototrophy in an auxotrophic 
yeast mutant, for example the ura3 , Ieu2 , his3 or trp1 gene. It is also possible to employ as markers 
structural genes which are associated with an autonomously replicating segment providing that the host to 
be transformed Is auxotrophic for the product expressed by the marker. 

30 Of particular importance are marker genes which complement A. niger host lesions, such as the arg B 
gene coding for the ornithine carbamoyl transferase, e.g. derived from A. niger or A. nidulans (EP 184~438), 
or A. nidulans DNA fragments homologous to the N. crassa pyr4 gene (ref. 27). 

Preferred embodiments of the present invention are hybrid vectors comprising an expression cassette 
of the invention, e.g. such comprising the structural gene and/or the promoter and/or the DNA sequence 

35 coding for a leader or signal peptide and/or the transcriptional terminator derived from a PG gene of the 
invention. 

Examples of hybrid vectors comprising an expression cassette of the invention are pGW1800, 
PGW1900, pGW1910, pGII-IFN AM119, pGtlss-IFN AM119, pGW1756. XPG-A10. XPG-A43. XPG-D8 and 
\PG-D11. 

40 Subject of the invention is not only a recombinant DNA molecule comprising a PG gene derived insert 
defined hereinbefore, but also a DNA molecule comprising a PG gene or a functional fragment thereof, e.g. 
such comprising promoter, coding region for signal peptide, leader peptide or a protein with PG activity or 
transcriptional terminator region itself which can be isolated from a PG producing fungal strain, for example, 
from Aspergillus spec .. e.g. A. iaponicus . A. orysae , A. nidulans . A. niger , Trichoderma spec Botrytis 

45 spec .. Sclerotinia spec ., Fusarium spec, or other phytopathogenic fungi as well as from yeast, for example 
from PG producing Kluyveromyces spec . such as K. fragiiis . Preferred are DNA molecules isolated from 
A. niger. 

The DNA molecules of the invention and derivatives thereof, including fragments, can be used for 
screening DNA gene libraries or mRNA for further similar DNAs or mRNAS. 

50 

Process for the Preparation of the Recombinant DNA Molecules 



Further object of the invention is a process for the preparation of a recombinant DNA molecule defined 
55 hereinbefore comprising culturing a host transformed with such a recombinant DNA molecule or preparing it 
by an in vitro synthesis. 

The culturing of the hosts is carried out in a conventional nutrient medium which may be supplemented 
with or deprived of chemical compounds allowing negative or positive selection of the transformants, i.e. 
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such hosts containing the desired DNA molecuie together with a selection marker, from the non- 
transformants, i.e. such hosts lacking the desired DNA molecule. 

Any transformable hosts useful in the art may be used, e.g. bacteria, such as E. coli . fungi, such as 
Saccharomyces cerevisiae , Kluyveromyces lactis , or in particular filamentous fungi, such as Aspergillus , 
5 e.g. A. nidulans , A. oryzae , A. carbonarius , A. awamori and especially A. niger . A preferred host is A. 
niger^AnS , a mutant lacking thi pyr A gene, aFdescribed further below, or A. niger N593. Transformation 
of the hosts is carried out by conventional methods. 

A DNA sequence comprised in a recombinant DNA molecule of the invention can be obtained from the 
genome of a fungus expressing polygalacturonase, or can be prepared for example by culturing a host 
70 which is transformed with a recombinant DNA molecule of the invention and. when required, isolating the 
desired DNA sequence therefrom, or by chemical synthesis through nucleotide condensation. 
In particular, such DNAs can be prepared by 
a) isolating genomic DNA from suitable fungal cells, and selecting the desired DNA, e.g. using a DNA 
probe or using a suitable expression system and screening for expression of the desired polypeptide, or 
75 b) isolating mRNA from suitable fungal cells, selecting the desired mRNA, e.g. by hybridization with a 
DNA probe or by expression in a suitable expression system and screening for expression of the desired 
polypeptide, preparing single-stranded cDNA complementary to that mRNA, then double-stranded cDNA 
therefrom, or 

c) isolating cDNA from a cDNA library and selecting the desired cDNA. e.g. using a DNA probe or using 
20 a suitable expression system and screening for expression of the desired polypeptide, or/and 

d) incorporating the double-stranded DNA of step a), b) or c) into an appropriate vector. 

e) transforming appropriate host cells with the obtained hybrid vector. 

i) selecting transformed host cells which contain the desired DNA from untransformed host cells and 
m uiti pi i eating the transformed host cells, and, when required, 

25 g) isolating the desired DNA and/or converting the DNA into a mutant or fragment thereof. 

Genomic DNA may be isolated and screened for the desired DNA (step a). Genomic DNA is isolated 
from suitable fungal strain expressing proteins with PG activity. A genomic DNA library is prepared 
tiierefrom by digestion with suitable restriction endonucleases and incorporation into suitable vectors 
following established procedures. The genomic DNA library is screened with a DNA probe as described 

30 hereinafter, or expressed in a suitable expression system and the obtained polypeptides screened in 
conventional manner, A detailed description of the isolation of a DNA molecule of the invention from a 
genomic library is given hereinafter. 

Polyadenylated messenger RNA (step b) is isolated from the suitable cells, by known methods. Isolation 
methods involve, for example, homogenizing in the presence of a detergent and a ribonuclease inhibitor, 

55 e.g. heparin, guanidinium isothiocyanate or mercaptoethanol, extracting the mRNA v/ith suitable chloroform- 
phenol mixtures, optionally in the presence of salt and buffer solutions, detergents and/or cation chelating 
agents, and precipitating mRNA from the remaining aqueous, salt-containing phase with ethanol, 
isopropanol or the like. The isolated mRNA may be further purified by centrifuging in a cesium chloride 
gradient followed by ethanol precipitation and/or by chromatographic methods, e.g. affinity chromatography, 

40 for example chromatography on oiigo(dT) cellulose or on o}igo(U) sepharose. Preferably, such purified total 
mRNA is fractionated according to size by gradient centrifugation, e.g. in a linear sucrose gradient or 
chromatography on suitable size fractionation columns, e.g. on agarose gels. 

The desired mRNA is selected by screening the mRNA directly with a DNA probe, or by translation in 
suitable cells or cell-free systems and screening the obtained polypeptides. 

45 The selection of the desired mRNA is preferably achieved using a DNA hybridization probe, thereby 
avoiding the additional step of translation. Suitable DNA probes are DNAs of known nucleotide sequence 
consisting of at least 17 nucleotides, for example synthetic DNAs. cDNAs derived from mRNA coding for 
the desired polypeptides, or genomic DNA fragments comprising e.g. adjacent DNA sequences which are. 
isolated from a natural source or from a genetically engineered microorganism. 

50 Synthetic DNA probes are synthesized according to known methods as detailed hereinbelqw. preferably 
by stepwise condensation using the solid phase phosphotriester. phosphite triester or phosphoramidite 
method, e.g. the condensation of dinucleotide coupling units by the phosphotriester method. ■ These 
methods are adapted to the synthesis of mixtures of the desired oligonucleotides by using mixtures of two, 
three or four nucleotides dA. dC. dG and/or dT in protected form or the corresponding dinucleotide coupling 

55 units in the appropriate condensation step as described by Y, Ike et al. (Nucleic Acids Research 1J_ , 
477.1983). 

For hybridization, the DNA probes are labelled, e.g. radioacttvely labelled by the well known kinase 
reaction. The hybridization of the size-fractionated mRNA with the DNA probes containing a label is 
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performed according to known procedures, i.e. in buffer and salt solutions containing adjuncts, e.g. calcium 
chelators, viscosity regulating compounds, proteins, non-homologous DNA and the like, at temperatures 
favoring selective hybridization, e.g. between 0°C and 80 *C, for example between 25 *C and 50 'C or 
around 65" C. preferably at around 20* lov/er than the hybrid double-stranded DNA melting temperature. 

5 Fractionated mRNA may be translated in cells, e.g. frog oocytes, or in celt-free systems, e.g. in 
reticulocyte lysates or wheat germ extracts. The obtained polypeptides are screened for enzymatic activity 
or for reaction with antibodies raised against the native polypeptide, e.g. in an immunoassay, for example 
radioimmunoassay, enzyme immnoassay or immunoassay with fluorescent markers. Such immunoassays 
and the preparation of polyclonal and monoclonal antibodies are well known in the art and are applied 

10 accordingly. 

The preparation of a single-stranded complementary DNA (cDNA) from the selected mRNA template is 
well known in the art, as is the preparation of a double-stranded DNA from a single-stranded DNA. The 
mRNA template is incubated with a mixture of deoxynucleoside triphosphates, optionally radioactively ^ 
labelled deoxynucleoside triphosphates (in order to be able to screen the result of the reaction), a primer 

15 sequence such as an oligo-dT residue hybridizing with the poly(A) tail of the mRNA and a suitable enzyme 
such as a reverse transcriptase e.g. from avian myeloblastosis virus (AMV). After degradation of the 
template mRNA e.g. by alkaline hydrolysis, the cDNA is incubated with a mixture of deoxynucleoside 
triphosphates and a suitable enzyme to give a double-stranded DNA. Suitable enzymes are for instance a 
reverse transcriptase, the Klenow fragment of E. coli DNA polymerase ! or T4 DNA polymerase. Usually, a 
. '20 hairpin loop stucture formed spontaneously byTheTingle-stranded cDNA acts as a primer for the synthesis 
of the second strand. This hairpin structure is removed by digestion with 81 nuclease. Alternatively, the 3'- 
end of the single-stranded DNA is first extended by homopolymeric deoxynucleotide tails prior to the 
hydrolysis of the mRNA template and the subsequent synthesis of the second cDNA strand. 

In the alternative, double-stranded cDNA is isolated from a cDNA library and screened for the desired 

25 cDNA (step c). The cDNA library is constructed by isolating mRNA from suitable cells, and preparing 
single-stranded and double-stranded cDNA therefrom as described above. This cDNA is digested with 
suitable resctriction endonucleases and incorporated Into X phage, e.g. X charon 4A or X gt11 following 
established procedures. The cDNA library replicated on nitrocellulose membranes is screened by using a 
DNA probe as described hereinbefore, or expressed in a suitable expression system and the obtained 

30 polypeptides screened for reaction with an antibody specific for the desired compounds. 

A variety of methods are known in the art for the incorporation of double-stranded cDNA or genomic 
DNA into an appropriate vector (step d). For example, complementary homopolymer tracts may be added 
to the double-stranded DNA and the vector DNA by incubation in the presence of the corresponding 
deoxynucleoside triphosphates and an enzyme such as terminal deoxynucleotidyl transferase. The vector 

35 and double-stranded DNA are then joined by base pairing between the complementary homopolymeric talis 
and finally ligated by specific joining enzymes such as ligases. Other possiblities are the addition of 
synthetic linkers to the termini of the double-stranded DNA, or the incorporation of the double-stranded DNA 
into the vector by blunt- or staggered-end ligation. Appropriate vectors will be discussed in detail 
hereinbelow. 

40 The transformation of appropriate host cells with the obtained hybrid vector (step e) and the selection 
and multiplication of transformed host cells (step f) are well known in the art. Examples for such methods 
are given further below. Hybrid vectors and host cells may be particularly suitable for the production of 
DNA, or else for the production of the desired polypeptides. 

The Isolation of the desired DNA, mutants and fragments therof according to the invention Is achieved 
45 by methods known in the art, e.g. extraction with phenol and/or chloroform. Optionally, the DNA can be 
further manipulated e.g. by treatment with mutagenic agents to obtain mutants, or by digestion with 
restriction enzymes to obtain fragments, modify one or both termini to facilitate incorporation into the vector, 
remove intervening sequences and the like. 

The nucleotide sequence of a DNA according to the invention can be determined by methods known 
50 per se , for example by the Maxam-Gilbert method using end-labelled DNA or by the dideoxy chain 
termination method of Sanger. 

PG gene sequences of the present invention can also be prepared by an in vitro synthesis according to 
conventional methods. The in vitro synthesis is especially applicable for the preparation of smaller 
fragments of a PG expression cassette, e.g. of the DNA sequences of a PG gene coding for a promoter or a 
55 "signal peptide, for example of the PGl, PGII or the pgaC gene, of a PG gene comprised in a DNA molecule 
defined hereinbefore, especially in the X-clones, or a mutant thereof. 

Suitable methods for the synthesis of DNA have been presented in summary form by S.A. Narang 
(Tetrahedron 39 . 3. 1983). The known synthesis techniques allow the preparation of polynucleotides up to 
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120 bases in length, in good yield, high purity and in a relatively short time. Suitably protected nucleotides 
are linked with one another by the phosphodiester method (K.L. Agarwa! et al., Angew. Chemie 84 , 489, 
1972), the more efficient phosphotriester method (C.B. Reese, Tetrahedron 34 , 3143, 1972), the phosphite 
triester method {R.L Letsinger et al„ J. Am. Chem. Soc 98 3655. 1976) or~phosphoramidite method (S.L 

5 Beaucage and M.H. Carruthers. Tetrahedron 22 , 1859^1981). Simplification of the synthesis of the 
oligonucleotides and polynucleotides is made poisibie by the solid phase method, in wich the nucleotide 
chains are bound to a suitable polymer. H. Rink et al. (Nucl. Acids Research ^2 , 6369, 1984) use 
trinucleotides instead of individual nucleotides and link them by the phosphotriester method in the solid 
phase synthesis. A polynucleotide can thus be prepared in a short time and with good yields. The actual 

70 double-stranded DNA is built up enzymatically from chemically prepared over-lapping oligonucleotides from 
both DNA strands, which are held together in the correct arrangement by base-pairing and are then 
chemically linked by the enzyme DNA tigase. 

Another possibility comprises incubating overlapping single oligonucleotides from the two DNA strands 
in the presence of the four required deoxynucieoside triphosphates with a DNA polymerase, for example 

15 DNA polymerase I, the Klenow fragment of polymerase I or T4 DNA polymerase, or with AMV (avian 
myeloblastosis virus) reverse transcriptase. The two oligonucleotides are thereby held together in the 
correct arrangement by base-pairing and are supplemented with the required nucleotides by the enzyme to 
give a complete double-stranded DNA (S.A. Narang et ai., Anal. Biochem. 121 , 356, 1982). 

In the following the preparation of a recombinant DNA molecule oPthe present invention from a 

20 genomic library and homologous and heterologous hybridization conditions for identifying members of the 
PG gene family are described in more detail. 

A genomic library can be prepared e.g. by partial digestion of genomic DNA of an A. niger strain, e.g. 
NW756 or N400, with e.g. Sau 3Ai or Mbol, and cloning the high molecular weight DNA fragments in a 
suitable host vector, e.g. the E. coli plasmid pUN121 or a lambda vector, e.g. EMBL4. 

25 Other fungal strains producing the desired PGs, for example, Aspergillus spec. , e.g. A. japonicus , A. 
oryzae , A. nidulans , A. niger . Trichoderma spec. , Botrytis spec ., Sclerotinia spec Fusarium spec, or 
other phytopathogenic fungi as well as yeast, for example PG producing Kluyveromyces spec . such as K. 
fragiiis may serve as source for the genomic library and other suitable vectors, e.g. those mentioned 
hereinbefore, may be used as recipient for the fragments. 

30 In order to successfully screen the genomic library for DNA sequences coding for PGs a hybridizing 
DNA probe is necessary. This can be a synthetic DNA probe if the amino acid sequence or part thereof of 
the desired PG is known, or another PG gene or a part thereof, which hybridizes to a desired PG gene. As 
neither a PG sequence nor a PG gene or part thereof were known prior to the invention, the problems of 
purification of PGs, structure determination of the N-terminal sequence of a PG and preparation of useful 

35 DNA probes were solved first. 

For the purification of the present PGs any source containing it may be used. For example, crude 
sources such as enzyme mixtures obtained from Aspergillus niger , such as Rapidase®. a commerciaily 
available mixture of pectinolytic enzymes, may serve as source. 

The purification follows conventional purification methods, such as salting out, desalting, reprecipitation 

40 in form of a different salt, chromatography, e.g. affinity chromatography such as on a crossiinked alginate, 
ion exchange chromatography, e.g. on a DEAE-Sephadex or Sepharose column, gel permeation chromatog- 
raphy, e.g. on a Sephacryl column, electrophoresis, e.g. with SDS-polyacrylamide gel, isoelectric focusing, 
and the like, or any combination thereof. 

in the present invention, PGi, PGII, PGillA, PGlilB and PG!V were isolated in a pure form from 

45 Rapidase®. Said PGs are endo-polygalacturonases (E.G. 3.2.1.15) with following physico-chemical prop- 
erties: PGI has a Mr of 55 K, an isoelectric point (lEP) of between 3.2 and 3,5 and a pH-optimum for the 
enzymatic activity of 4.9. The values for PGM are a Mr of 38 K. an !EP of between 4.8 and 5.9, a pH- 
optimum of 4.8; PGIIIA has a Mr of 57 K, an lEP of 3.3 and a pH-optimum of 4.3, PGIIIB has also 57 K and 
an lEP of 3.3, however, the pH-optimum is 4.5. Finally, PGIV has a Mr of 59 K, an lEP of 3.7 and a pH- 

50 optimum of 4.8. The Mr values given are determined by SDS-polyacryiamide gel electrophoresis, the lEP 
values by thin-layer isoelectric focusing. 

The N-terminal sequencing of PGI revealed following sequence: 
ala^-ser-thr-X^-thr^-phe-thr-ser-ala^, 

55 The N-terminal sequencing of a 21 kDa BrCN-fragment of PGi revealed the sequence ala-ser-thr-X*-thr- 
phe-thr-ser-ala-ser-glu, 

i.e. the N-terminal sequence of PGI, and sequencing of a 5.5 kDa BrCN-fragment revealed the sequence 
ala-asp-g!y-ala-val-ile-asp-gly-asp-gly-ser. 
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The N-ternnina! sequencing of PG17 revealed the sequence 
asp'-ser-X3-thr-phe5-thr-thr-ala-aIa-ala'°-a!a-Iys-ala^^ 
and of a BrCN-fragment of 17 kDa the sequence 
a!a-phe-ser-val-gin-ala-asn-asp-ile-thr-phe. 

and X* in the sequences at the tinne were not identified. 

Based on the amino acid sequence of the 5.5 kDa BrCN-fragment of PGI the following oligonucleotide 
mixture was synthesized 

met ala asp gly ala val 
HR 6298 5' (d) ATG GCI GARi GGI GCI GTI 

ile asp gly asp gly 

ATR3 GARi GGI GARi GG 3' 

Based on the amino acid sequence of the 17 kDa BrCN-fragment of PGII the following two 
oligonucleotide mixtures were synthesized 



met ala phe ser val gin ala 

HR6195 ,5' (d) ATG, GCI TTRi TCI GTI CAR2 GCI 

HR6196 5' (d) ATG GCI TTRi AGI GTI CAR2 GCI 

asn asp ile 

30 HR6195 AARi GARi AT 3' 

HR619 6 AARi GAR^ AT 3' 



In the sequences of the oligonucleotides, I is inosine. Ri is T or 0, R2 is A or G and R3 is T, 0 or A. 

The mixtures were radioactively marked and used to screen a genomic library of Aspergillus niger N400 
for PGI and PGII genes, respectively. ■ 

The identified genes or gene fragments are then sequenced according to conventional methods. The 
sequences of the PGI and PGII genes of A. niger N400 are represented by the sequences with SEQ ID NO. 
1 and 2, respectively, depicted in the sequence listing. 

For screening purposes the DNA probes are radioactively labelled by methods known in the art. e.g. 
using y-^P-ATP and T4 kinase or a^^p-dATP and E. coli DNA polymerase I, depending on the type of probe 
used. Host microorganisms carrying nucleic acids of the present invention as an insert, are identified by 
hybridization with the labelled DNA probe on filter replicas of the gene library. 

Clones showing a hybridization response to one or more DNA probes are isolated and amplified. 

The hybridization conditions used are conventional and can be more or less stringent. 

Stringent conditions are such under which only DNA sequences with a high degree in homology can 
hybridize ("homologous" hybridization conditions). Under "heterologous" or less stringent conditions, also 
related DNA sequences with a lower degree in homology can hybridize. The stringency of the hybridization 
is influenced, for example, by the hybridization and washing temperature, formamide or salt concentration, 
G + C content of the DNA, length of the hybridization time and length of the DNA probe. Illustrative but not 
limiting examples of "homologous" and "heterologous" hybridization conditions are given hereinafter in the 
Examples. 

With the use of one or more DNA probes derived from the PGI or PGII gene, particularly such 
comprising at least part of the coding regions thereof, hybridizing clones derived from a genomic library, 
e.g. of A. niger , preferentially of A. niger N400 or A. niger NW 756, can be identified and divided into 
different classes based on their degree of homology and on the basis of the hybridizing restriction 
fragments observed. Examples of preferred clones derived from a A. niger N400 library are XPG-A9, -A10, 
-A43. -B4, -B35, -B36, -C1. -016, -C20. -037. -D8. -D11. -D12, -Eer-EasT-FS. -G17, -G27, -X31 and -Y33. 
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Their preparation is described in detail hereinafter in the Examples. An Example for a preferred clone 
derived from A. niger NW756 is the plasmid pGW1756. 

These clones as well as plasmids pGW1800, pGW1803. pGWl900. pGW1902 and pGW1910 can be 
used to prepare other recombinant DNA molecules of the invention, in particular such containing fragments 
5 of the PG gene sequences comprised therein. Said other recombinant DNA molecules are prepared in 
conventional manner by applying conventional restriction enzymes, linkers, ligation, amplification and 
isolation processes. Preferentially, expression vectors are prepared which may be pro- or eukaryotic vectors 
or shuttle vectors for propagation both in pro- and eukaryotic cells. Examples for the construction of such 
shuttle vectors- are well known in the art. The expression vectors may contain homologous or heterologous 
10 genes. Examples of such genes are given hereinafter. 

The inserts of the recombinant DNA molecules of the invention or fragments thereof can be obtained in 
conventional manner, e.g. after cleaving the recombinant DNA with suitable restriction enzymes and 
isolating the desired DNA fragments after agarose gel electrophoresis. 

Mutants of the DNA molecules of the invention, particularly of the PG gene sequences, containing new 
75 restriction sites can also be prepared, for example in vitro by site-directed mutagenesis, according to 
conventional methods [see review article of M.J. Zoller and M. Smith. Methods Enzymol. 100. 468 (1983). 
D. Botstein and D. Shortie, Science 229 ,1193 (1985) or K. Norris et al.. Nucl. Acids Res. 1_l_ .5103(1983)]. 

The invention concerns also the use of the recombinant DNA molecules of the invention for the 
preparation of hybrid vectors for the expression of a structural gene. 

20 

Transformed hosts 



Furthermore, the invention concerns transformed host ceils for amplifying the recombinant DNA molecules 
25 of the invention or particularly for expressing an expression cassette comprised in a recombinant DNA 
molecule of the invention. 

Examples of suitable hosts, particularly for amplifying the recombinant DNA molecules of the invention, 
are microorganisms which are devoid of or poor in restriction enzymes or modification enzymes, such as 
bacteria, in particular strains of Escherichia coli . for example E. coli XI 776, E. coii Y1090, E. colt W3110, E. 

30 coli HB101/LM1035. E. coli JA 221, E. coli"DH5a, or preferentiaTiy"E. coli DHbaF , JM1097MHT or HBIOTT 
or^E. coli K12 strairi^ "Bacillus subtNis~~r Bacillus stearothermophiTus~T Pseudomonas , Haemophilus , 
Streptococcus and others, and yeasts, for example Saccharomyces cerevisiae such as S. cerevisiae GRF 
18. Further suitable host cells are cells of higher organisms, in particular established continuous human or 
animal cell lines, e.g. human embryonic lung fibroblasts LI 32. human malignant melanoma Bowes cells, 

35 HeLa ceils. SV40 virus transformed kidney cells of African green monkey COS-7 or Chinese hamster ovary 
(CHO) cells. 

Examples of suitable hosts for expressing an expression cassette of the invention are the cells 
mentioned hereinbefore and in particular filamentous fungi, for example Penicillium, Cephalosporium or 
preferentially Aspergillus spec. e.g. A. carbonarius , A. awamori or preferentially A. niger , A. nidulans or A. 
40 oryzae . 

Preferred transformed hosts are E. coH MH1 transformed with pGW1800 or pGW1803, E. coli JM109 
transformed with pGWISOO or pGW1803. E. coli DH5aF' transformed with pGW1900, 1902. 1756 or 1910. 
pGll-IFN AM1 19 or pGllss-IFN AM1 19. Aspergillus niger An8 or N593 or Aspergillus nidulans transformed 
with pGII-IFN AM119 or pGllss-lFN AM119 and optionally with the selection marker plasmid pCG59D7. 

45 The invention concerns also a method for the preparation of such transformants comprising treatment of 
a suitable host cell under transforming conditions with a recombinant DNA molecule of the present 
invention, especially a hybrid vector of the invention, optionally together with a selection marker gene and 
optionally selecting the transformants. 

Transformation of microorganisms is carried out according to conventional methods as described in the 

50 literature, for example for S. cerevisiae (A. Hinnen et al.. Proc.Natl.Acad.Sci.USA. 75 , 1929,1978), for B. 
subtilis (Anagnostopoulos eTal., J. Bacteriol. 81_ , 741, 1961), and for E. coii (M. Mandel et ai.. J. Mol. BioT 
53 , 159, 1970). ~~ 

Accordingly, the transformation procedure of E. coli cells includes, for example, Ca^ pretreatment of 
the cells so as to allow DNA uptake, and incubation with the hybrid vector. The subsequent selection of the 

55 transformed cells can be achieved, for example, by transferring the cells to a selective growth medium 
which allows separation of the transformed cells from the parent cells dependent on the nature of the 
marker sequence of the vector DNA. Preferably, a growth medium is used which does not allow growth of 
cells which do not contain the vector. The transformation of yeast comprises, for example, steps of 
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enzymatic removal of the yeast cell wall by means of gfucosidases, treatment of the obtained spheroplasts 
with the vector in the presence of polyethylene glycol and Ca^ ions, and regeneration of the cell wall by 
embedding the spheroplasts into agar. Preferably, the regeneration- agar is prepared in a way to allow 
regeneration and selection of the transformed cells as described above at the same time. 
5 Transformation of cells of higher eukaryotic origin, such as mammalian cell lines, is preferably achieved 

by transfection. Transfection is carried out by conventional techniques, such as calcium phosphate 
precipitation, microinjection, protoplast fusion, electroporation, i.e. introduction of DNA by a short electrical 
pulse which transiently increases the permeability of the cell membrane, or in the presence of helper 
compounds such as diethylaminoethyldextran, dimethyl sulfoxide, glycerol or polyethylene glycol, and the 

10 like. After the transfection procedure, transfected cells are identified and selected e.g. by cultivation in a 
selective medium chosen depending on the nature of the selection marker, for example standard culture 
media such as Dulbecco's modified Eagle medium (DMEM), minimum essential medium, RPMI 1640 
medium and the like, containing e.g. the corresponding antibiotic. 

The transformed host cells are cultured by methods known in the art in a liquid medium containing 

75 assimilable sources of carbon, e.g. carbohydrates such as glucose or lactose, nitrogen, e.g. amino acids, 
peptides, proteins or their degradation products such as peptones, ammonium salts or the like, and 
inorganic salts, e.g. sulfates, phosphates and/or carbonates of sodium, potassium, magnesium and calcium. 
The medium furthermore contains, for example, growth-promoting substances, such as trace elements, for 
example iron, zinc, manganese and the like. 

20 The medium is preferably so chosen as to exert a selection pressure and prevent the growth of ceils 
which have not been transformed or have lost the hybrid vector. Thus, for example, an antibiotic is added to 
the medium if the hybrid vector contains an antibiotic resistance gene as marker. If, for instance, a host cell 
is used which is auxotrophic in an essential amino acid whereas the hybrid vector contains a gene coding 
for an enzyme which complements the host defect, a minimal medium deficient of said amino acid is used 

25 to culture the transformed cells. 

Ceils of higher eukaryotic origin such as mammalian cells are grown under tissue culture conditions 
using commercially available media, for example Dulbecco's modified Eagle medium (DMEM). minimum 
essential medium, RPMI 1640 medium and the like as mentioned above, optionally supplemented with 
growth-promoting substances and/or mammalian sera. Techniques for cell cultivation under tissue culture 

30 condition are v/el! known in the art and include homogeneous suspension culture, e.g. in an airlift reactor or 
in a continuous stirrer reactor, or immobilized ^or entrapped cell culture, e.g. in hollow fibres, microcapsules, 
on agarose microbeads, porous glass beads, ceramic cartridges, or other microcarriers. 

Culturing is effected by processes which are known in the art. The culture conditions, such as 
temperature, pH value of the medium and fermentation time, are chosen so that a maximum titer of the 

35 polypeptide or derivative of the invention is obtained. Thus, an E. coM or yeast strain is preferably cultured 
under aerobic conditions by submerged culture with shaking or stirring at a temperature of about 20 'C to 
40° C, preferably at about 30 'C, and a pH value of 4 to 8. preferably of about pH 7, for about 4 to 30 
hours, preferably until maximum yields of the polypeptide or derivative of the invention are reached. 

In order to allow selection of the transformed from the nontransformed ceils, the DNA molecules of the 

40 invention carry a selection marker or, alternatively, the cells are cotransformed with a second vector 
containing such marker. As in other systems such selection marker is an expressible, structural gene, the 
expressed polypeptide of which (an enzyme) provides resistance against compounds toxic to the receipt 
organism or which completes the enzyme system of a mutant lacking such essential polypeptide. Such 
marker genes suitable for selection of transformed filamentous fungal cells are, for example, the known qa-2 

45 , pyrG , pyr4 , trpC , amdS or argB genes. 

As described in EP 278.355 a marker gene, named pyrA , was isolated from the genomic library of A. 
niger , which is related to and has similar function as pyr G of A. nidulans and pyr4 of N. crassa , namely 
producing the enzyme orotidine s'-phosphate decarboxTlise. This enzyme catalyses the~decarboxylation of 
orotidine 5 -phosphate to uridylic acid (uridine s'-phosphate) and also of fluoro-orotic acid to the toxic 

50 fluoro-uridine. An E, coli clone containing the pyrA gene was identified by hybridization with the 1.1 kb Hind 
III fragment of pE5jB2~(ref. 25) containing part of the pyr4 gene. However, DNA of any other pyr gene 
coding for orotidine-5 -phosphate decarboxylase may be used. From a positive clone named~~E. coli 
B75183/pCG59D7, the plasmid pCG59D7, comprising the pyrA gene, was isolated and used for cotransfor^ 
mation of an A. niger pyrA" mutant. Such pyrA~ mutant is defective in the orotidine 5 -phosphate 

55 decarboxylase gene and therefore is unable to produce the corresponding enzyme. Such mutant was 
prepared by treating conidiospores of A. niger N756 under mutating UV-irradiation and colonies surviving in 
the presence of fluoro-orotic acid and~uridine are selected. Colonies surviving in the presence of fluoro- 
orotic acid and absence of uridine are eliminated. The remaining uridine-requiring mutants, according to 
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their ability of being transformable, belong to two complementation groups pyr A and pyr B, represented by 
mutants An8 and AnIO. respectively. They are treated in the form of protoplasts thereoTunder transforming 
condition with the pyr A containing plasmid pCG59D7. Only the An8 colonies were found to be transformed 
and to contain the pyr A gene as evidenced by the hybridizing ability of digested DNA thereof with DNA of 
5 pUN121. 



Preparation of polypeptides 

w The invention concerns further a method for the preparation of polypeptides, characterized in that a 
homologous or heterologous structural gene, for example such the meaning of which is given hereinbefore, 
inserted in an expression cassette of the Invention is expressed in a suitable transformed host according to 
conventional methods. When required, the polypeptide is isolated in conventional manner. Depending on 
the construction of the expression cassette, the products are either produced or. if a signal sequence is 

75 present are produced and secreted. Usually, the expression cassette is comprised in a hybrid vector, but 
may also become integrated into the host genome after transformation. 

A suitable host is for example a fungus, e.g. a filamentous fungus, particularly an Aspergillus strain, if a 
promoter derived from a PG gene of the invention or another promoter derived from filamentous fungi is 
used for the expression of a homologous or heterologous structural gene. However, if other promoters are 

20 used, for example such derived from prokaryotes or higher eukaryotes, other hosts are suitable, for example 
bacteria such as E. coli, or higher eukaryotic cells, respectively. 

The promoters of the PG genes of A. niger are inducible in the A. niger cell, i.e., the expression of the 
structural gene attached thereto, e.g. the structural gene coding for a~PG or any foreign gene, is induced by 
addition of pectin or pectin degradation products to the medium. In the presence of sufficient glucose, 

25 hov^ever, the promoter is not inducable, if an A. niger strain, e.g. An8 or N593, is used as host. This means, 
genes under the control of an A. niger PG promoter are "catabolite repressed" in A. niger . However, if 
another Aspergillus strain is used, preferentially A. oryzae or most preferentially A. nidulans , a gene under 
the control of an A. niger PG promoter is expres"sed constitutiveiy, i.e. also in the" absence of pectin and/or 
in the presence of glucose. It can therefore be advantageous to express genes under the control of an A. 

20 niger PG promoter in an Aspergillus host other than A. niger , preferentially A. oryzae or most preferentially 
A. nidulans , because, for example, glucose instead" of pectin can be added to the nutrient medium as 
energy and carbon source during the expression of a desired gene. 

If a promoter not derived from a PG gene of the present invention is used for the con struction of an 
expression cassette of the invention, e.g. comprising a structural gene encoding a PG, other hosts can be 

25 used as expression hosts. The suitable hosts depend on the promoter used and are e.g. bacteria such as E. 
coli . or yeast, such as S. cerevisiae or Kluyveromyces lactis . Suitable hosts and promoters for the 
preparation of polypeptides according to the invention are also those suitable for transformation given 
hereinbefore. 

It is now possible to overexpress one or more desired PGs, whereby various methods can be applied. A 

40 purified single PG can be prepared by subjecting a pectinoiytic enzyme mixture containing a PG to 
conventional purification methods. Another method for the production of a single PG, preferentially PGI. 
PGII or the product of the pga C gene, is characterized in that a suitable host which is not capable of 
expressing any PG or which expresses PGs in low amount or which does not express PG under the 
induction conditions used for the introduced PG gene, is transformed with a hybrid vector comprising a 

45 structural gene coding for a PG, e.g. PGI, PGH or the pgaC gene product, or a fragment of a PG with PG 
activity, and that said structural gene is expressed. If a host not capable of expressing any PG is used, the 
respective single PG can be obtained in pure form, that means uncontaminated by any other PG. It is also 
possible to produce predetermined mixtures of PGs, optionally together with other enzymes, by expressing 
in a transformed host not only a single but more than one desired PG genes, optionally together with genes 

50 encoding other, enzymes. Predetermined mixtures can also be obtained by expressing one or more desired 
genes for PG and/or other enzymes, e.g. for pectin esterases, pectin lyases, cellulases, mixed endo- 
glucanases. hemicellulases, xylanases, arabinases, galactanases, a-and ^-glycodidases, and the like, in a 
host strain, optionally such with a certain background in enzyme production. 

Predetermined mixtures of enzymes can also be obtained by disrupting particular PG genes in the host 

55 by "gene disruption", which is a technology well known in the art, using the isolated PG genes of the 
invention. 

A host not capable of expressing any PG is either a microorganism having no corresponding gene e.g. 
a PG" Aspergillus strain, another PG~ non-Aspergillus fungus or any other PG~ eukaryotic or prokaryotic 
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cell, or an Aspergillus strain, e.g. A. oryzae or A. nidulans , whose expression of endogeneous PG genes 
are suppressed in an appropriateTy conditioned" growth nnedium, e.g. are "catabolite repressed" and/or 
uninduced, whereas the exogenous PG promoter operatively linked with the desired PG structural gene, e.g. 
an A. niger derived pronaoter, is active under these conditions or where the PG gene is fused to another 
5 promoter. 

Other promoters and strains suitable for the preparation of PGs are the same as given hereinbefore in 
the description of the expression cassettes. 



10 The polypeptides and compositions 

The single PGs purified from a pectinolytic enzyme mixture, preferentially from an Aspergillus niger 
derived pectinolytic enzyme mixture, in particular from Rapidase®, and PGs encoded by a DNA sequence 
according to the present invention and derivatives thereof with PG activity, in particular if produced in a 
75 suitable host transformed with an expression hybrid vector coding for such PG or a derivative thereof with 
PG activity, and physiologically acceptable salts thereof are also subjects of the present invention. In 
particular preferred are PGI, PGII, PGIIIA, PGIIIB and PGIV of Aspergillus niger , in purified form. The 
invention concerns said polypeptides whenever produced by a method according to the present invention. 

The Invention concerns further enzymatic compositions comprising one or more of such single PG 
20 and/or a derivative thereof with PG activity and/or biologically acceptable salts thereof optionally in a 
predetermined combination with one or more suitable enzymes having other than PG activity. 

Enzymes having other than PG activity suitable for the preparation of said enzymatic compositions are 
degrading and modifying plant cell wall polymers. Such enzymes are e.g. pectin esterases, pectin lyases, 
cellulases, mixed endo-glucanases, hemicellulases, xylanases, arabinases, galactanases. a- and jS- 
25 glycodidases, and the like. 

The present invention concerns further the preparation of said enzymatic compositions in conventional 
manner. 

The use of said single PGs and/or of derivatives thereof with PG activity and/or of said enzymatic 
compositions in plant material processing are also a subject of the present invention. 
30 Single PGs of the invention and/or derivatives thereof with PG activity or enzymatic compositions 

comprising such PG or derivative are useful e.g. for clarification of vegetable or fruit juice, for the enhancing 
of the juice yield in vegetable or fruit juice production and of pressing yield of oil-containing seeds or fruits, 
for stabilization of vegetable or fruit juice, for reduction of the viscosity of vegetable or fruit juice, for the 
liquefaction of biomass, for maceration, for the enhancement of natural product extraction like natural 
35 pigments, aromas and flavours, for the valorization of biomass, food or feed, to improve the recovery of 
cellulose fibers in paper pulping, and the like. 

The most preferred embodiments of the invention are those described hereinafter in the Examples. 

^^Q"^ Description of the Figures 

Figure 1_ : Partial restriction maps of the EcoRI fragments of phages \D and XE obtained from a genomic 
library of A. niger by hybridization with the combined DNA probes HR6195 and HR6196 coding for part 
of the PGII gene. The numbers indicate the approximate distances in kbp from the right hand EcoRi 
45 sites. 

Figure 2 : Partial restriction map of pGW1800. The plasmid contains DNA of the vector pEMBL18 and 
the 4.1 Kbp Xbal-EcoRI Insert obtained from phage \E which comprises the A. niger N400 PGII gene. Ap 
is the amplcillin resistance gene and the arrows indicate the A. niger derived "DNX 

Figure 3 : Partial restriction map of pGW1900. It Is composed of the pUC9 vector and a 8.6 kbp BamHI 
50 fragment of phage PGI-X7. 

Figure 4 ; Cloning strategy for plasmlds pGll-lFN AM119 and pGMss-IFN AM1 19. 

Figure 5 : Stepwise preparation of DNA inserts DNA 5 and DNA 6 for the construction of pGII-IFN AM1 19 
and pGllss-!FN AM1 19, respectively, with the polymerase chain reaction (PGR) method. 
Figure 6 : Partial restriction map of pGW1756. pGW1756 is the 5.5 kbp Xhol-Bglll fragment which carries 
55 the A. niger NW756 polygalacturonase II gene inserted in -he vector pEMBLIB. Restriction sites in the 
vector are not shown. The former Xhol and Bglll sites of the fragment are also shown, but these have 
been destroyed by the insertion in the Sail and BamHI sites, respectively, of the vector. 
Figure 7 : Partial restriction map of pGW1910. pGW1910 is the 7.8 kbp Bglll fragment of lambda phage 
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XPG-C20 inserted in the BamHI site of the vector pUC9. The approxinnate location of the A. niger N400 
gene encoding PGC ( pga C) gene is indicated. 
The following examples serve to illustrate the invention, however, are in no way intended to restrict it. 
The abbreviations have the following nneanings: 
5 Amp ampieillin 

bis Tris bis (2-hydroxyethyl) imino-tris (hydroxymethyl) methane 
BSA bovine serum albumin 
DTT 1 ,4-dithiothreitoi 

EDTA ethylenediamine tetra acetic acid, disodium salt 
10 !PTG isopropyl-j8-D-thiogaiactopyranoside 

kbp kilo base pairs 

PEG polyethylene glycol 

PTH phenylthiohydantoin 

SDS sodium dodecyl sulfate 
75 Tet, tetracycline 

TFA trifluoro acetic acid 

TP tryptic peptide 

Tris tris (hydroxymethyl) 'aminomethane 
X-gal 5-bromo-4-chioro-3 indolyl-jS-gaiactoside 

20 

Buffers, media, reagents 



SM 100 mM NaCI, 8.1 mM MgSO^, 50 mM Tris-HCI pH 7.5.0.01 % gelatin 
25 LB 1 % trypticase peptone (BBL). 0,5 % yeast extract (BBL), 1 % NaCl and 0.5 mM Tris-HCI pH 7.5 
LM 1 % trypticase peptone (BBL), 0.5 % yeast extract (BBL), 10 mM NaC! and 10 mM MgCb 
PBS 0.37 g NaHaPO*. 2.7 g Na2HP04. 3.5 g NaCI per litre H2O 
SSC 0.15 M NaCI, 0.015 M tri-sodium citrate 

PSB 10 mM Tris-HCI. pH 7.6, 100 mM NaCI, 10 mM MgCi2, 0.05% (w/v) gelatine. 

30 TE lOmM Tris-HC! pH 8.0, 0.1 mM EDTA pH 8.0 

minimal medium 1 litre contains 1.5 g KHaPOi, 0.5 g KC!, 0.5 g MgSOiVHaO. 0.9 mg 2nSOi*7H20, 0.2 
mg MnCl2*4H20. 0.06 mg CoCl2-6H20. 0.06 mg CuSOi'SHaO, 0.29 mg CaCl2*62H20, 0.2 mg 
FeS04-*7H20, nitrogen and carbon sources as specified in the text or 6 g NaNOs and 10 g glucose per litre 
if these sources are not explicitly mentioned, adjusted to pH 6.0 with NaOH 

35 complete medium minimal medium with 6 g NaNOa and lOg glucose per litre, plus per litre 2 g trypicase 
peptone (BBL), 1 g casaminoacids (Difco), 1 g yeast extract (BBL). 0.5 g ribonucleic acid sodium salt from 
yeast (ICN. Cleveland, USA), 2 ml vitamin solution, adjusted to pH 6.0 with NaOH 

vitamin solution per 100 ml 10 mg thianriine, 100 mg riboflavin, 10 mg panthotenic acid, 2 mg biotin, 10 mg 

p-aminobenzoic acid, 100 mg nicotinamide, 50 mg pyridoxin-HCi 
40 TBE 1 litre contains 4 ml of a 0.5 M EDTA pH 8.0 solution, 10.8 g Tris and 5.5 g H3BO3 

phenol phenol treated as described by Maniatis et al . {p438; ref. 6) 

sample buffer 10 % (v/v) glycerol, 100 mM EDTA"pH 8.0 and 0.01 % bromophenol blue 

RNase A RNase A treated as described by Maniatis et al . (p451; ref. 6) The following strains are used: 

A. niger N400 wild type 
45 A^ niger N402 csp A 

A^ niger NW756 high pectinase producing strain 

AT niger N756 high pectinase producing strain 

A. niger An8 DSM 3917, uridine auxotrophic mutant of the pectinase complex highly producing strain A. 
niger N756 ~ 
50 A.niger N593 csp A, pyr A 

E. coli NM539"7net BT^p E, hsd M*. hsd R~, sup F, (P2cox3) 

E. coli LE392 PThsd"R514 (rir\ mk'yTsup E4~sup F58. lac Y1, or lac 1ZY)6. gal K2, gal T22. metBI, trp 
R55, X- ^ ^ 

E. coli DH5aF' f', end A1, hsd R17, (rk", mk*). sup E44. thi -1. rec Al, gyr A, rel A1,)8O0lac Z Ml 5, A(lac 
55 ZYA-arg F)U169, X" ~ ^ ~ 

E. con JM109 end A1. rec A1, gyr A96, thi , hsd R17 (rk", mk*). rel A1, sup E44, X", (lac-pro AS), [F', tra 
D36. pro AB, lac IqZ M15] ~ 

E.coli JM110 rps l-. th£ . . £[1 • 9^' • ^ > ■ -^^P ^^4, (lac-pro AB). [F', tra 
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D36. pro AB. lac IqZ M15] 

The following vectors are used 



5 pGW613 



This plasmid has been described by Goosen et al. (ref. 13). 



70 Ml 3nnp phage 

The M13mp 18 and M13nnp19 vectors (Norrander et al ref. 24) are derivatives of the single-stranded 
DNA bacteriophage MIS and are designated to facilitate DNA sequencing by allowing cloning of DNA 
fragments at a versatile polyiinker site and the cloning of the sanne restriction fragment in both possible 

15 orientations. Sequences cloned into these vectors can readily be used as teinplates for sequencing 
reactions or the production of single-stranded probes using standard ollgodeoxyribonucleotide primer and 
the Klenow fragment of the E. coli DNA polymerase I. The vector DNA is carrying the E. coli lacoperon 
promoter and the genetic information of the first 145 amino acids of ;S-galactosidase. The polyiinker 
sequences containing multiple restriction sites are inserted into the lacZ sequence. The polyiinker retains 

20 the lacZ reading frame and the vector gives allelic complementation of a LacZa host strain, yielding blue 
plaques on plates containing IPTG and X-gaL Recombinant phages containing inserts that destroy the 
reading frame or otherwise interfere with expression of the lacZa peptide are revealed as colorless plaques. 

25 pGW635 

The plasmid is a shorter version of pGW613. It also contains the pyr A gene. It is described by Goosen 
et al. (ref. 42). 

30 

EMBL4 

EMBL4 is a lambda replacement vector with a cloning capacity of 9-23 kbp (Frischauf et al ., ref. 9). It 
contains a multiple cloning region between the lambda arms and the non-essential stuffer region. This 
35 allows multiple restriction enzyme digestions to be performed in a manner such that religation of the stuffer 
to the vector arms is reduced as the foreign DNA of interest is inserted. The vector also makes use' of the 
phenotype Spi to provide a direct selection for recombinants (Zissler et al ref. 21). 



40 pEMBL18 and pEMBL19 

These plasmids have been described by Dente et al. (refs. 10,22,23). 



45 Example 1 : Isolation and characterization of polygalacturonase 



Example 1.1: Purification of polygalacturonases I, M. HIA. lilb. and IV 

50 The enzyme activity is determined in the enzyme fractions obtained hereinafter as described before 
(Rozie et al ref. 2) using a modified ferricyanide test (Robyt et al ., ref. 3). 

Polygalacturonases I, II, MIA, NIB, and IV are purified from Rapidase®, a commercially available mixture 
of pectinolytic enzymes obtained from Aspergillus niger (Gist-brocades, Seclin, France). 50 g of the crude 
enzyme powder (Lot-No. K2B 078) is dissolved in 190 ml of a 20 mM sodium acetate buffer pH 3.6. 
55 centrifuged for 10 min. at 25,000 g to remove solids and desalted on a Sephadex G-50 column (5 x 90 cm) 
equilibrated in the same buffer. 

The first step in the purification procedure is an affinity chromatography step using crosslinked alginate 
V7hich is prepared as described in Rombouts et al . (ref. 1). The cross-linked alginate which has a bed 
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volume of 5.2 m/g. is also equilibrated with 20 mM sodium acetate buffer pH 3.6 (column dimensions 2.5 x 
30 cm). The desalted enzyme is loaded onto this column and then washed by applying a 100 mM sodium 
acetate buffer pH 4.2 and pH 5.6, respectively (400 ml of each). Whereas PGI and PGII adsorb to the 
alginate matrix, PGs IIIA, lllb. and IV do not. 

5 

Example 1.1.1: Purification of PGI and PGII 

The adsorbed PGI and PGII proteins from the crosslinked alginate column are eluted by a linear sodium 
10 chloride gradient (0-1 M) in a 100 mM sodium acetate buffer at pH 5.6 which is a slight modification of the 
procedure used before (Rombouts et aL , ref. 1). PGI coelutes with PGII at about 0.5 M NaCI. 

The enzyme fraction comprising PGI and PGM is dialyzed against 20 mM sodium acetate buffer pH 5.7 
and applied to a DEAE-Sephadex A-50 column (2.5 x 30 cm) equilibrated in the same buffer. The enzymes 
are eluted using a linear NaCI gradient (0-0.6 M). PGI elutes at about 0,3 M NaCI, PGII at about 0.2 M. The 
15 enzyme fractions containing PGI and PGII are collected separately, dialyzed against 20 mM bis Tris HCI 
buffer pH 5.8 and chromatographed on a DEAE-Sepharose Fast Flow colum equilibrated in the same buffer. 
Upon applying a NaCI gradient, PGII is eluted at approximately 120 mM sodium chloride, PGI at about 250 
mM. Active enzyme fractions containing PGI or PGil are concentrated after dialysis against 20 mM sodium 
acetate buffer pH 5.7 to a final volume of 25 ml on a small DEAE-Sepharose Fast Flow column (1.6 x 10 
20 cm) by efuting with 1 M sodium chloride in this buffer. Final purification of each of the tNO enzymes PGI 
and PGII is achieved by gel permeation chromatography on Sephacryl S200 (1.6 x 90 cm) in 0.1 M sodium 
acetate pH 4.2. 

PGII shows a single band upon SDS-polyacrylamide gel electrophoresis and has an apparent molecular 
mass of 38 kDa, The specific activity is 2760 U/mg of protein determined in 0.075 M sodium acetate buffer 
25 pH 4,2. Upon iso-electric focusing the enzyme shows microheterogeneity. Besides a pronounced major 
component at approximately pH 5.2, a large number of minor bands is observed in the pH range between 
4.6 and 5.9 

PGI also shows a single band upon SDS-polyacrylamide gel electrophoresis. This enzyme has an 
apparent molecular mass of 55 kDa. The specific activity is 550 U/mg of protein determined in 0.075 M 
30 sodium acetate buffer pH 4.2. Upon iso-electric focusing the enzyme also shows micro-heterogeneity. 
Several bands are observed in the pH-range of pH3.2-3.5. 



Example 1.1. 2.: Purification of PGtllA. PGIIIB and IV 

35 

The effluent of the crosslinked alginate column (Example 1.1) containing the unbound PGs is adjusted 
to pH 5.7 with 1 M sodium hydroxide and is loaded on a DEAE-Sephadex A-50 column (2.5 x 30 cm), 
equilibrated in 0.02 M sodium acetate buffer pH 5.7. The enzymes are eluted from the DEAE-Sephadex 
column with a linear sodium chloride gradient (0-1 M). PGIV elutes at 0.38 M sodium chloride, PGIIIA and 
40 PGlllB coelute at 0.5M. 

5 ml of each of the two enzyme fractions are desalted on a Sephacryl S-200 column (1.6 x 90 cm) in 
0.1 M sodium acetate buffer pH 4.2. The active fractions are pooled, diluted five fold with distilled water and 
loaded on a MONO Q column (Pharmacia Fine Chemicals, Uppsala, Sweden) equilibrated in 0,02 M bis- 
Tris/HCi buffer pH 5.8. Upon applying a 20 ml linear sodium chloride gradient (0.1-0.4 M), PGiV elutes at 
45 0.32 M sodium chloride, whereas PGs lilA and IliB coelute at 0.4 M sodium chloride. 

Both enzyme fractions are separately re-chromatographed on a MONO Q column under the conditions 
described above. Fractions containing PG activity are pooled, dialyzed against 0.025 M piperazine/HCI 
buffer pH 6.0 and loaded on a MONO P column (Pharmacia Fine Chemicals, Uppsala, Sweden) equilibrated 
in the same buffer. The column is eluted with 10 % (v/v) poiy buffer 74 (Pharmacia Fine Chemicals, 
50 Uppsala, Sweden) pH 3.0 at a flow rate of 0.75 ml/min. 

Final purification is achieved by gel permeation chromatography on a TSK G 3000 SW column (0.75 x 
30 cm) (LKB, Bromna, Sweden) equilibrated in 0.05 M sodium phosphate buffer pH 6.2 at a flow rate of 0.5 
ml/min. 

The column is calibrated with the following protein standards; bovine serum albumin (68 kDa); egg 
55 albumin (45 kDa) and chymotrypsinogen A (25 kDa). 

An apparent molecular mass of 53 kDa is determined for PGIV by gel permeation chromatography. 
PGIV shows a single band upon SDS-polyacrylamide gel electrophoresis and has an apparent molecular 
mass of 59 kDa determined by electrophoresis on a 15 % polyacrylamide gel. The specific activity is 780 
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U/mg of protein determined in 0.075 M sodium acetate buffer pH 4.2. Upon iso-electric focusing the 
enzyme shows a single sharp band at pH 3.7. 

Gel permeation chromatography of the active fractions containing PGIilA and IlIB on the calibrated TSK 
Q 3000 SW column, under the same conditions as described for PQIV. results in separation of the two 
5 enzymes with apparent molecular masses of 32 and 52 kDa respectively. Upon SDS-polyacrylamide gel 
electrophoresis (15 % polyacrylamide gel), however, both enzymes show a single band with an identical 
molecular mass of 57 kDa. PGIIIA and IIIB have a specific activity of 150 and 250 U/mg protein, 
respectively, determined in 0.075 M sodium acetate buffer pH 4.2. Upon iso-electric focusing both enzymes 
show a single sharp band at a pH-value of 3.3. 



Example 1.2.: A comparison of the properties of the polygalacturonases and their immunological relation- 
ship 

75 Starting with 50 g Rapidase (Gist-brocades, Seclin, France) five endopolygalacturonases are purified as 

described in Example 1.1. The properties of these purified enzymes are summarized in Table II. 

Table II. 



Physico-chemical properties and 


pH-optima of five A. niger 




polygalacturonases. 


Enzyme 


Mr^ 


I.E.P.*^ 


pH-optlmum 








(activity) 


PGI 


55K 


3.2-3.5 


4.9 


PGll 


38K 


4.6-5.9 


4.8 


PGIIIA 


57K 


3.3 


4.3 


PGIIIB 


57K 


3.3 


4.5 


PGIV 


59K 


3.7 


4.8 


Notes: 









^ Extrapolated value determined by SDS-geleiectrophoresis. 
^ Isoelectric point, determined by thin-layer isoelectric focusing. 



Ail purified enzymes have an apparent molecular mass on a 15 % SDS-polyacrylamide gel in the range 
of 55-59 kDa, except PGll which appears to be a much smaller protein (38 kDa). The special nature of PGll 
40 Is emphasized- by its relatively high iso-eiectric point in contrast to the low iso-electric point values 
determined for the other polygalacturonases. The pH optimum of all polygalacturonases is in the same 
range (4.3-4.9). 

Specific antibodies against purified PGI and PGll are raised in male New Zealand white rabbits 
according to the immunization scheme described by Uitzetter (ref. 36). 

45 Crossreactivity between the two antisera and the five purified polygalacturonases is monitored as 
outlined below. 0.4-1.0 ug purified protein is loaded on a 10 % SDS-polyacrylamide gel and after 
electrophoresis transferred from the gel to a nitrocellulose membrane using the method described by 
Bowen et al . (ref. 37). Incubation of the nitrocellulose blot with the specific antisera followed by staining 
with peroxi"dase labelled goat anti-rabbit IgG is done according to Uitzetter (ref. 36). Upon incubation with 

50 the PGI specific antiserum a strong signal is found for PGI and the PGs IIIA, 1118 and IV. PGll gives a 
slightly weaker signal with this antibody. Incubation of the blot with PGll specific antiserum results in a high 
Intensity band for PGll but only weak bands for the other four polygalacturonases. 

55 ' Example 1.3: Amino acid sequence determination of cyanogen bromide fragments of polygalacturonase II 



Approximately 100 ug of PGll is dissolved in 150 ul 70 % formic acid and solid cyanogen bromide is 
added in a hundred fold molar excess to methionine (of which 4 residues occur per enzyme molecule). The 
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reaction mixture is stirred continuously for 24 h at room temperature in a closed reaction vial. After 24 h 
water is added and the preparation is dried by flushing with N2. This step is repeated to remove formic 
acid. 

After electrophoresis on 15 % SDS-poIyacrylamide gels approximately 10 individual bands can be 
5 stained by Coomassie Brilliant Blue. They represent incompletely and completely cleaved fragments. 
Besides the intact protein which has a molecular mass of 38 kDa, the following bands are observed: 33, 30, 
27, 19, 17, 12, 10. 7.5, 6 and 6 kDa. By taking samples at regular intervals during the 24 h reaction period 
and analyzing these by SDS-polyacrylamide gel electrophoresis, the order in which partial and final reaction 
products appear as well as their position with respect to each other has been determined. The 17 kDa 
10 fragment is an internal fragment whereas the 5 kDa fragment is located either at the C-terminus or N- 
terminus. The fragments of the 24 h treatment are then blotted onto Immobilon-P, which is a polyvinylidene 
difluoride membrane (Millipore), according to the procedure described by Matsudaira et al. (ref. 4). Three 
fragments are used for gas-phase sequencing viz a 17 kDa internal fragment and two smaller fragments (5 
and 6 kDa). 

15 Amino acid sequences are determined with an Applied Biosystems mode! 470 A protein sequencer, 
connected on-line with an Applied Biosystems 120 A PTH analyzer. Membrane fragments containing 0.5-1 
nmole of a particular peptide are washed, loaded onto the gas-phase sequencer and subjected to sequence 
analysis according to the programme described by Amons (ref. 5). 
For the 5 kDa fragment the following sequence is obtained: 

20. 

Position : 1 5 

Amino acid: asp - ser - X - thr - phe - thr - thr - ala - ala - 

Position: 10 12 13 

Amino acid: ala - ala - iys (ala) - (ala) 



30 



35 



40 



45 



At position 3 (X) no amino acid is detected. Since the sequencing programme used only detects cysteine 
residues if the protein is S-pyridyiethylated before, it is likely that position 3 may be a cysteine (Amons, ref. 
5). In position 12 and 13 traces of ala are found as is indicated by brackets. While the trace in position 12 
clearly results from an incomplete reaction in the previous step the iow signal of ala in position 13 
represents the amino acid ala in position 13 of the peptide. 
The sequence for the 17 kDa fragment is 

position: 1 5 

amino acid: ala - phe - ser - val - gin - ala - asn - asp - 
position: 10 

amino acid: ile - thr (ile) - phe (thr) 

The traces of isoleucine and threonine observed in the last two steps as is indicated by the brackets 
clearly result from incomplete reactions in the previous steps. 



Example 1.4.: Amino acid sequence determination of the N-terminai part of polygalacturonase I 



100 ug of PGI purified according to Example 1.1.1 is dialyzed three times against II Millipore filtered 
water and lyophllized. The amino acid sequence is determined as described in Example 1.3. The following 
N-terminal amino acid sequence has been determined for the enzyme 

Position : 1 5 9 

amino acid: ala - ser - thr - X - thr - phe - thr - ser - ala 
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Thy cysteine residues in the protein have not been modified and are thus not detected. Cysteine is likely to 
occur at position 4 (X) of the sequence. The N-terminal annino acid sequence of PGI shows honnology with 
the annino acid sequence of the 5 kDa cyanogen bronnide fragnnent of PGII which has been isolated from 
Rapidase (Example 1.3) and with the N-terminal amino acid sequence of PGII isolated from the A. niger 
5 transformant N593/pGW 1800-27 (Example 6.3). This homology is outlined in the following scheme, in 
which it is assumed that cysteine occurs at position 4 and 3 of the RGI and PGII sequences, respectively: 

PGI ala - ser - thr - cys - thr - phe - thr - ser - ala 
10 PGII asp - ser - cys - thr - phe - thr - thr - ala 



The open space between the serine residue at position 2 and the assumed cysteine residue at position 3 in 
the polygalacturonase II sequence is introduced in order to align both sequences. 

The serine residue at position 8 in the PGI sequence is not present at the corresponding position in the 
PGII sequence (position 7). but in the latter case a similar type amino acid, i.e. a small hydroxy! amino acid 
(threonine), is present. 

The N-terminal amino acid sequences of PGI and PGII thus show homology, but are not identical. The 
differences are such that these cannot be the result of partial modification of a product of a single gene, 
such as for instance partial proteolytic cleavage. It is concluded that PGI and PGII are encoded by different 
genes of the same family. 



25 



30 



35 



55 



Example 1 .5: Amino acid sequence determination of cyanogen bromide fragments of polygalacturonase I. 

Approximately 100 iig of PG I is dissolved in 150 y.1 70% formic acid and solid cyanogen bromide is 
added to prepare fragments of the protein as described in Example 1.3. 4 methionine residues are present 
per enzyme molecule. After electrophoresis on 15% SDS-po!yacrylamide gels and staining with Coomassie 
Brilliant Blue, major bands are observed having a molecular mass of 39.5, 35, 21, 19.5 and 5.5 kDa. 

The 21 and 5.5 kDa fragments are sequenced using the same equipment and procedures as described 
in Example 1.3. 

The sequence for the 21 kDa fragment is 

Position 1 

Amino acid ala - ser - thr - X - thr - phe - 



Position 10 

Amino acid thr - ser - ala - ser - glu 

Cysteine is likely to occur at position 4 (X) of the sequence similar to what has been described in Examples 
1.3 and 1.4. The sequence of the 21 kDa fragment is identical with the N-terminal amino acid sequence of 
45 the intact PG I protein described in Example 1.4. 

The 5.5 kDa fragment has the following sequence 

Position 1 5. 

50 Amino acid ala - asp - giy - ala - val - ile 

Position 10 
Amino acid asp - gly - asp - gly - ser 



It is concluded from these sequence data that the 5.5 kDa fragment is not corresponding with the N- 
terminus of the protein. 
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20 



25 



Example 1.6: Amino acid sequence determination of a tryptic peptide 



PGIl is partially denatured by dialysis against 0.5 % (v/v) formic acid and subsequently freeze-dried. 
The enzyme (1 mg) is suspended in 1 ml of a 0.2 M N-ethyl morpholino-acetate buffer pH 8.0 and then 

5 incubated at 37* C with trypsin (Sigma) which is added in a molar ratio of 1:50 with respect to PGIl. After 24 
h the digestion is stopped by the addition of acetic acid (30 % final concentration). The digest is freeze- 
dried. Samples of the digest are dissolved In 0.1 % trifluoroacetic acid containing 2.5 mM dithiothreitol and 
kept at 37* C for at least 1 h. Tryptic peptides (100 ul samples containing 100-200 ug of protein) are 
separated using a Spectra Physics SP 8000 HPLC equipped with a Nucleosil C-18 column (4.6 x 150 mm) 

10 (Supeico) and a Vydac guard column (Chrompack). A a column temperature of 25 'C, a flow rate of 2 
ml/min and a linear gradient is applied which changes in 50 min from 100% solvent A (0.1 % TFA in water) 
to 50% solvent A + 50 % solvent B (0.1 TFA in acetonitril). Peptides are detected by UV absorption at 214 
nm. One of the peptides which is released late during digestion appears well separated from other peaks in 
the chromatogram at approx. 21 % acetonitril. This peptide is dried by flushing with dry air at 25 'C and 

75 sequenced. 

The amino acid sequence is determined according to the procedure described in Example 1.4. The 
sequence is: 



Position: 1 5 

Amino acid: thr - ile - ser - giy - ala - thr - giy - ser - 

Position: 10 14 

Amino acid: val - ser - glu - ile - thr - tyr 



Example 2: Construction of a genomic library of Aspergillus niger 



Example 2.1: isolation of high molecular weight DNA from A. niger N400 



Conidiospores of Aspergillus niger strain N400 are inoculated in 200 ml minimal medium in a final spore 
density of 10^ spores/ml and shaken in 1 I Erienmeyers for 24 h at 28* C at 300 rpm using a New 
Brunswick rotary shaker. The mycelium is harvested by filtration using a Buchner funnel with Myracloth, 
washed with cold sterile saline, frozen in liquid nitrogen and either stored at -60 'C or used directly. The 
method used for isolation of DNA to prepare the genomic library is based on the procedure described by 
Yelton et a!, (ref. 18). 

For library construction, 10 g mycelium is ground in liquid nitrogen in 1 g portions in a Braun micro- 
dismembrator. The ground mycelium is transferred to a 1 1 sterile erienmeyer. containing 200 ml extraction 
buffer (50 mM EDTA pH 8.5, 0.2 % SDS) and 200 u! diethyipyrocarbonate. The mixture is slowly warmed 
up to room temperature and then heated for 20 min to 68* C with occasional shaking. The suspension is 
cooled to room temperature and centrifuged for 15 min at 12,000 x g. 1/16 volume of an 8 M potassium 
acetate solution pH 4.2 is added to the supernatant and the mixture is left on ice for 1 h. The precipitate is 
removed by centrifugation (20 min.; 16,000 x g; 4'C). The nucleic acids are precipitated from the 
supernatant by an incubation with 0.6 volume of isopropanol on ice for 15 min. The precipitated nucleic acid 
is collected by centrifugation (10 min.; 6,000 x g; 4'C), washed with 70 % ethanol and briefly dried. The 
pellet is suspended in 10 ml TE containing 20 ug/ml Rnase A, (Boehringer, Mannheim) and incubated for 
15 min at 37*0. The DNA is treated with nuclease free pronase (1 mg/ml final concentration) (Kochlight, 
Coinbrook) for 1 h at 37* C. The pronase stock solution in TE buffer contains 20 mg/ml of enzyme which is 
pre-incubated for 1 h at 37* C to digest nucleases. 

8.5 g CsCl is dissolved in 9 ml of the DNA solution obtained. 0.2 ml 10 mg/ml ethidium-bromide is 
added and this solution is either, centrifuged in a Beckman SW41 rotor for 60 h at 33,000 rpm. or in a 
Beckman 50 Ti rotor for 40 h at 45,000 rpm. The DNA band is collected and the ethidiumbromide is 
removed by multiple extraction with isopropanol equilibrated with a satured solution of NaCI in water. 5 
volumes of TE are added and the DNA solution is sequentially treated with TE saturated phenol, 
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phenol/chloroform/isoamylalcohol 25:24:1 and chioroform/isoamyialcohol 24:1. The DNA is precipitated by 
addition of 0.1 volume of 3 M sodiunn acetate pH 5.2, 2.5 volumes of ethanol and an overnight incubation at 
-20'C. The precipitate is collected by centrifugation (1 h, 30,000 x-g: 4"C). washed with 70 % ethanol, 
dried and dissolved in 400 u\ TE. 

5 

Example 2.2: Partial digestion of A. niger N4Q0 DNA with Mbol and isolation of fragments 



To test for the Mbol concentration which gives the largest amount of DNA fragments between 13.6 and 

70 23 kbp, 1 HQ portions of A, niger N400 DNA are digested in the appropriate buffer recommended by the 
supplier with decreasing amounts of Mbol (0.5-0.001 U) for 1 h at 37* C in a volume of 10 ul. The reaction 
is stopped by addition of 1 ii\ 0.25 M EDTA. and the samples are loaded on a 0.6 % agarose gel in TBE 
buffer, containing 1 ug/m! ethidiumbromide. Convenient size markers are a mixture of lambda DNA and 
lambda DNA digested with Bgll!, which gives bands of 49, 22.8. 13.6. 9.8, 2.3, kbp and a nonvisible 

75 fragment of 0.45 kbp. The Mbol concentration required to give a high yield of the desired 13.6-23 kbp 
fragments is about 0.02 U/ug DNA. Accordingly, 200 ug of DNA In a total volume of 2 mi are digested, and 
are divided into 20 equal aliquots immediately after addition of the enzyme. After 1 h at 37' C the digests 
are placed on ice. After checking a 1 ul sample for proper digestion by running on a gel. EDTA is added to 
a final concentration of 25 mM. the enzyme is heat-inactivated at 65' C for 10 min, samples are pooled and 

20 the DNA is precipitated, washed, dried and dissolved in 400 ill TE. 

The fragmented DNA is separated on a 0.4 % preparative agarose gel (center well 120 x 1.5 mm). 
Lambda DNA digested with Bglil is used as marker to determine the size of the partially digested DNA 
fragments upon electrophoresis at 4°C and 40 V (3 V/cm). The gel region containing fragments of the 
correct size is cut out of the gel and the DNA is electroeluted from the gel in a sterile dialysis tube in 2 ml 

25 TBE for 2-3 h at 100 V. The current is reversed for 30 s, and the buffer containing the DNA is collected. 
The fragments are then concentrated by ethanol precipitation and dissolved in 100 ul TE. 



Example 2.3: Preparation of vector DNA and cloning of high molecular weight DNA fragments of A. niger 
30 N400 into EMBL4 



The genomic library of A. niger strain N400 is constructed in the lambda vector EMBL4. The vector, 
which has a cloning capacity" of 9-23 kbp, is described by Frischauf et ai. (ref. 9) and Karn et al. (ref. 19) 
and has been purchased from Promega Biotech. Inc. To avoid double Inserts originating from different parts 

35 of the genome, a minima! fragment length of 13.6 kbp is used for cloning. 

10 ug lambda EMBL4 DNA is digested to completion with 50 units of BamHI in the buffer recom- 
mended by the supplier in a volume of 100 ul for 2 h at 37'C. The enzyme is inactivated for 10 min at 
65' C. The NaCI concentration is raised to 150 mM and 50 units of sail are added and incubation at 37* C 
continues for another 2 h. After addition of EDTA to 25 mM and inactivation of the enzyme (heating for 10 

40 min at 65 'C) the solution is extracted with equal volumes of phenol (TE saturated), 
phenol/chloroform/isoamylalcohol 25:24:1, and chioroform/isoamyialcohol (24:1). To eliminate the small 
BamHI/Sall polylinker fragments, the DNA is precipitated with 0.6 volume of isopropanol after the addition of 
0.1 vol. 3 M sodium acetate pH 5.2. After 15 min on ice and 15 min centrifugation at 12,000 x g at 4* C, the 
precipitate is thoroughly washed with 70 % ethanol, dried and dissolved in 40 u! TE. 

45 

Example 2.4: Ligation and in vitro packaging of genomic A. niger N400 DNA fragments 



It is essential that the cos sites of the vector prepared according to example 2.3 are annealed prior to 
50 the ligation reaction. The vector in 100 mM Tris-HCl pH 7.5 and 10 mM MgCb is heated for 10 min at 65* C 
and then annealed for 1 h at 42* C. From test ligations a ratio of vector to fragments of approximately 1:1 
(by weight) is found to give most recombinants. Ligation took place in 50 mM Tris HCl pH 7.5, 10 mM 
MgCl2, 10 mM DTT and 1 mM ATP, using 9.5 ug of vector and 10 ug of DNA fragments in a total volume 
of 100 ul. DNA ligase (BRL) is added at a concentration of 0.5 U/ug DNA and the ligation mixture is 
55 incubated overnight at 14* C. To test for ligation a sample of the ligated DNA is run on an agarose gel. Also, 
as a control 0.5 ug. of vector is ligated without the addition of fragments in a 5 ul volume. 

The large volume ligation mixture is concentrated by ethanol precipitation and dissolved in 20 ul TE 
prior to in vitro packaging. In vitro packaging is done with Promega Packagene extracts according to the 
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instructions of the manufacturer using 10 ul portions to package 1 ug of DNA. 1 ug of the high molecular 
weight control phage lannbda cl857 Sam 7, supplied with the extracts, is separately packaged as a control. 
After packaging. 500 ul of phage solution buffer (PSB) and 5 ul of chloroform are added. The recombinant 
phage stocks can be stored at 4*'c. The library obtained is constructed from two separate ligation 
experiments. 



Example 2.5: Titration and amplification of the A, niger strain N400 genomic library 



Cells of E. coli NM539 are grown on LB medium containing 0.2 % maltose, 10 mM MgSO* and 1 mM 
CaClg to an optical density (600 nm) of 1.0. 0.2 ml aliquots of this culture are added to 0.1 ml of an 
appropriate phage dilution in PSB.^ After adsorption of the phages for 20 min at 37* C. 3 ml 0.6 % LB top- 
agar having a temperature of 45 'C is added, the mixture is plated on LB agar plates and these are 
incubated overnight at 37* C. The number of plaque forming units (pfu) per ml phage suspension are 
12x10^ and 4,2x10^ pfu/ml for the two phage stocks prepared according to example 1.4. After subtracting 
the background which is calculated from the control ligations without fragments (17 % and 40 % 
respectively) the absolute number of recombinants is 6x1 0^. The DNA contained in the recombinants is 
equivalent to more than 200 of the Aspergillus niger genomes. 

To amplify the library, 80 ul aliquots of both phage stocks are used to infect E, coli NM539 cells which 
are plated in LB top-agarose on LB agar plates and then incubated overnight if 37" C. The phages are 
eluted from the agarose by gently shaking the plates with 5 ml PSB per plate for 1 h at room temperature. 
The PSB is collected, centrlfuged (10 min at 6000 xg) to remove bacteria and chloroform is added (0.5 % 
final concentration). Both phage stocks, which are amplified approximately to the same extent, are then 
mixed (40 ml stock), titrated (8x1 0^ pfu/ml) and stored at 4° C. 



Example 3: Screening the genomic library of A. niger for nucleic acids related to polygalacturonases 



Example 3.1: Synthesis of oligonucleotide mixtures coding for the 17 kOa cyanogen bromide fragment of 
polygalacturonase II " 

The oligonucleotides for screening the genomic library of Aspergillus niger prepared according to 
Example 2. are synthesized by using the phosphoramidite method (M.H. Carulhers, ref. 8) with an Applied 
Blosystem (model 380 B) oligonucleotide synthesizer. The 17 kDa cyanogen bromide fragment described in 
Example 1.2 is an internally located peptide. Therefore the amino acid sequence determined in Example 1.4 
can be extended at the N-terminus with a methionine residue. The oligonucleotides synthesized are 
complex mixtures as the degeneracy of the genetic code is taken into account. For technical reasons it is 
necessary to reduce the number of nucleotides In the mixture. To exclude those oligonucleotides with the 
wrong combination TG and AC at position 10 and 11 and to limit the size of the mixture, two separate 
mixtures of oligonucleotides corresponding to the sequence in the coding strand are synthesized taking Into 
account the degeneracy of the genetic code. The number of oligonucleotides required is further reduced by 
introducing tnosine as the wobble base at four different positions of the synthesized 29-mers and thus each 
mixture consists of 16 different oligonucleotides (Ohtsuka et ai. , ref. 16). The two mixtures which are 
named HR6195 and HR6196 represent DNA encoding the "following amino acid sequence and have the 
following composition: 
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70 



amino acid sequence: met ala phe ser val gin ala 

HR6195: 5' (d) ATG GCI TTRi TCI GTI CAR2 GCI 

HR6196: 5' (d) ATG GCI TTRi AGI GTI CAR2 GCI 

amino acid sequence: asn asp ile 
HR6195: AARi GARi AT 3' 

HR6196: AAR^ GARi AT 3' 

wherein I is inosine, Ri is T or C, R2 is A or G. 

Example 3.2: Synthesis of an oligonucleotide mixture coding for the 5.5 kDa cyanogen bromide fragment of 
polygalacturonase I. ~ 

20 . The 5.5 kDa cyanogen bromide fragment described in Example 1.5 is an internally located peptide 
which is prepared by CNBr cleavage. Therefore the amino acid sequence thereof can be extended at the N- 
terminus with a methionine residue. The number of oligonucleotides required is reduced by introducing 
inosine as the wobble base at five different positions of the 32-mers which are synthesized. 
This limits the mixture to 24 different oligonucleotides. The mixture, named HR 6298, represents DNA 

25 encoding the foilowing amino acid sequence and has the following composition: 

Amino acid sequence: met ala asp 

HR62 98 5' (d) ATG GCI GARi 



75 



30 



35 



40 



gly ala val ile asp gly asp gly 
GGI GCI GTI ATR2 GARi GGI GARi GG 3' 

wherein ( is inosine, Ri is T or C and R2 is T, 0 or A. 
Example 3.3: -^P-labeling of oligonucleotides 



The lyophilized oligonucleotide mixtures HR6195, HR6196 and HR6298 are dissolved in distilled water 
at concentrations of 29 uM. Samples of these solutions are diluted tenfold to make the reaction mixture. 
The reaction mixture is composed of either 20 pmoles oligonucleotides mixture HR6195 together with 20 
pmofes HR6196 or of 40 pmoles HR6293 alone, 34 pmoles of [y-^^PJ-ATP (NEN. 6000 Ci/mmoi). and 30 
^5 units T4 polynucleotide kinase (BRL) in 50 ul kinase buffer, which is 50 mM Tris-HCI (pH 7.6), 10 mM 
MgCl2. 5 mM DTT, 0.1 mM EDTA and 0.1 mM spermidine (Maniatis, et al. , ref. 6, page 122). The 
incubation is carried out at 37* 0 for 30 min. The reaction is stopped by the~a(3dition of 4 ul of 0.5 M EDTA 
(pH 8.0). The reaction mixtures are used without purification for screening the genomic library (Example 3.4) 
or probing Southern blots (Example 3.6). 

50 

Example 3.4: Screening of the A. niger N400 library 

Part of the genomic library of Aspergillus niger strain N400 described above (Example 2) is diluted in 
55 SM and 0.1 ml portions each containing about 2000 pfu are plated. Host cells are prepared by inoculating 
50 mi of LB-medium supplemented with 0.2 % maltose with 0.5 ml of an overnight culture of E. coli NM539. 
in LB-medium, shaking for 4 h at 250 rpm on a Gallenkamp orbital shaker at 37* C, followed bVtTie"addition 
of 0.5 ml 1 M MgS04 and of 0.5 ml 0.5 M CaCl2. 0.2 ml aliquots of these cells are each mixed with a 0.1 

29 



BNSDOCID:<EP 0421919A2> 



EP 0 421 919 A2 



mi portion of the phage suspension and these nnixtures are incubated at room temperature for half an hour. 
Then 3 ml of 0.7 % agarose in LM-medium at 47*" C are added, briefly vortexed and immediately ptated on 
LM agar plates. The plates are incubated overnight at 37* C and chilled for 2 h at 4' C, 

From each plate two replicas are made according to the Benton and Davis plaque hybridization method 

5 (ref. 7). The first filter (Schleicher and Schueil BASS) is placed on top of the plate for 1 min, the second 
replica for 2 min and the position of the replicas is marked using Jndia ink. After removing the filters they 
are placed in a dish containing 100 ml of a denaturing solution (1 M NaCI. 0.5 M NaOH) for 0.5 min, and 
then for 1 min in 100 ml neutralizing solution (0.5 M Tris-HCI pH 7.5. 1.5 M NaCI). The filters are transferred 
to a dish containing 3xSSC, are gently rubbed with a gloved hand to remove bacterial debris and are rinsed 

70 with 3xSSC. The filters are blotted, dried for 10 min at room temperature and baked on Whatman 3 MM 
paper in an oven at 80* C for 2 h. 

The baked filters are wetted in 3xSSC, washed in this solution for 1 h at room temperature and then 
transferred to a dish containing 250 ml prewarmed (65 'C) prehybridization mi>cture which consists of 
2xSSC if oligonucleotide mixture HR6195 and HR6196 is used and of 1xSSC if oligonucleotide mixture 

75 HR6298 is used, further of 10x Denhardt's (0.2 % BSA, Boehringer fraction V; 0.2 Ficoll 400, Pharmacia; 0.2 
% poiyvinylpyrrolidone-10, Sigma), 0.1 % SDS and 0.1 mg/ml sheared and freshly denatured herring sperm 
DNA. Prehybridization is performed for 5 h at 65* C in a shaking water bath. Next the filters are washed 
once for half an hour 'in 250 m! prewarmed (65 *C) hybridization mixture, which is identical to the 
prehybridization mixture, except for the omission of herring sperm DNA. Then the filters are placed in a 

20 different dish containing 150 ml of prewarmed (65* C) hybridization mixture to which the previously labeled 
combined oligonucleotide mixtures HR6195 and HR6196 (Example 3.1) or the labelled mixture HR6298 had 
been freshly added. 

In the case if HR6195 and HR6196 are used, the dish is placed in the shaking water bath at 65° C, the 
thermostate Is adjusted to 47" C and the hybridization is allowed to proceed for 14 h. The filters are washed 
25 once in 250 ml prewarmed (47 ' C) hybridization mixture for half an hour at 47* C, followed by washing at 
room temperature in two changes of 250 ml 2xSSC, each for 45 min. A stringent wash is performed by 
transferring the filters to a dish containing 250 ml of prewarmed (63 *C) 6xSSC, 0.05 sodium 
pyrophosphate, and next incubating for 30 min in a shaking water bath at 63' C. Then the filters are washed 
at room temperature in two changes of 2xSSC, for 30 min each. The filters are dried in the air, fixed on a 
30 support of Whatman 3MM paper, covered with plastic wrap and exposed to 
Kodak XAR5 film for three days at -60* C, using an intensifying screen. 

In this way, 6 positive signals are obtained from the six plates screened. Positive plaques are punched 
out with a sterile Pasteur pipette by carefully positioning the plates on the autoradiogram using the ink 
markers. The pieces of agar containing the positive plaques are added to 1 ml of SM and 2.5. ul of 

35 chloroform is added. The phages are allowed to diffuse from the agar at room temperature for 1 h vyith 
occasional vortexing and then incubated overnight at 4' C. The agar and bacterial cell debris are removed 
by centrifugation for 5 min, 2.5 ul of chloroform is added and the phage stocks are stored at 4' C. 

The positive clones are named \C to XG and XL Since phages are plated at high density, the positive 
plaques are purified twice by plating them at a low density and repeating the complete procedure of replica 

40 plating, hybridization and picking of positive plaques. 

In the case if the oligonucleotide mixture HR6298 Is used, the dish is placed into the shaking water bath 
at 65" C, the thermostat is adjusted to 52 "C and the hybridization is allowed to proceed for 14 h. Then the 
filters are washed once in 250 ml prewarmed (52° C) hybridization mixture for half an hour at 52° C followed 
by washing at room temperature in two changes of 250 ml 2 x SSC each for 45 min. ' Subsequently the 

45 stringent wash is performed by transfering the filters to a dish containing 250 ml of prewarmed (64* C) 4 x 
SSC, 0,05% (w/v) sodium pyrophosphate and next incubating it for 0.5 h in a shaking water bath at 64 °C. 
Then the filters are washed at room temperature in two changes of 2xSSC, for 30 min each. The filters are 
dried in the air, fixed on a support of Whatman 3MM paper, covered with plastic wrap and exposed to 
Kodak XAR5 film for three days at -60 ' C, using an intensifying screen. 

50 In this way nine positive signals are obtained from the six plates screened. Positive plaques are 
punched out with a sterile Pasteur pipette by carefully positioning the plates on the autoradiogram using the 
ink markers. The pieces of agar containing the positive plaques are added to 1 ml of SM and 2.5. ul of 
chloroform is added. The phages are allowed to diffuse from the agar at room temperature for 1 h with 
occasional vortexing and then incubated overnight at 4°C. The agar and bacterial cell debris are removed 

55 by centrifugation for 5 min. 2.5 ul of chloroform is added and the phage stocks are stored at 4' C. 

The positive clones are named PG I-X 1-9. The positive plaques are further purified by plating them at a 
low density using the 1.2 kbp Bam HI/Eco Ri fragment of pGW1803 (see Example 3.8) as a heterologous 
probe for hybridization which is carried out at 60 'C. whereas washing is performed in 2 x SSC, also at 
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60 'C. 



Example 3.5: Isolation of lannbda DNA 

To isolate DNA from the recombinant clones, phages are first amplified. For this purpose E. coii LE392 
host cells are grown to an optical density (600 nm) of 1.0 in LB-medium supplemented with IcTmlvrMgSO^ 
and 0.2 % maltose. Then 50 ul of the stocks of the purified phages are separately plated as described in 
Example 2.5. After an overnight incubation at 37'C the phages are eiuted from the nonconfluent plates by 

10 spreading 5 ml of SM over the plates and incubating for two hours with gentle shaking. The eiuted phages 
are harvested and 0.1 ml chloroform is added. The mixture is briefly vortexed and cellular debris is 
removed by centrifugation. The supernatants are recovered, chloroform is added to 0.3 % and the resulting 
plate lysate Is stored at 4* C. 

In order to obtain nearly confluent plates as starting material for the isolation of phage DNA, 10 ul 

/5 portions of the plate lysates are plated with E. coli LE392 host cells. After overnight incubation at 37* C the 
agarose top layer is scraped off from three Tiearly confluent plates. These layers are combined, 20 ml of 
SM and 0.4 ml of chloroform are added and the resulting mixture is shaken at 37* C for 30 min. Cellular 
debris and agarose are removed by centrifugation, the supernatant is recovered and its volume adjusted to 
18 mi with SM. An equal volume of 2 M NaCl. 20 % PEG6000 (BDH, Poole, GB) in SM is added and the 

20 solutions are mixed and placed on ice. After 75 min the phages are peiletted by centrifugation for 20 min at 
• 12000 X g at 4'C. The supernatant is decanted and the remaining fluid is removed with a Kleenex tissue. 
The pellet is resuspended in 3 ml SM and subsequently extracted with 3 ml of chloroform. The aqueous 
phase is treated with RNase A (67 ug/ml) and DNase I (33 ug/mi) for 20 min at 37' G. Then this mixture is 
extracted by adding 2 ml of phenol, vortexing, adding 1 ml of chloroform, vortexing again and separating 

25 the two phases by centrifugation. The aqueous phase is extracted twice more, with 3 ml of 
phenol/chloroform (1:1) and 3 ml of chloroform, respectively. Then the DNA is precipitated from the 
aqueous phase by the sequential addition of 0.3 ml 3 M sodium acetate buffer (pH 5.2) and 6 ml of ethanol. 
This mixture is left at 4' C for 16 h and then the DNA is recovered by centrifugation (10 min, 12000 x g. 
4'C). The pellet is dissolved in 0.4 ml of TE buffer, RNase A is added to 200 ug/ml, and incubated at 

30 37' 0 for 1 h. The DNA is precipitated, by the addition of 38 ul 3 M sodium acetate buffer (pH 5.2) and 0.8 
ml ethanol at 4'C for 1 h. The DNA is recovered by centrifugation and subsequently dissolved in 100 ul of 
TE buffer. 



35 Example 3.6: Restriction analysis of the A. niger N400 PGM and PGI lambda clones 



From the positive phages containing PGll gene sequences, XD and \E are selected for further analysis. 
First it is established by restriction analysis that both phages contain inserts which are derived from the 
same region of the A. niger genome and next a partial restriction map of XE is constructed. 

2 ug of phage DNA is digested with 20 units of EcoRI or BamHI or the combination of these enzymes 
in a volume of 100 ul for 3 h at 37*0 in the buffer recommended by the supplier (BRL) and in the 
presence of 0.8 mg/ml RNase A. Then 10 ul of a 3 M sodium acetate buffer (pH 5.2) is added and the 
mixture is extracted with 80 ui of chloroform. The DNA is precipitated from the aqueous phase by the 
addition of 250 ul of ethanol and the mixture is placed on ice for 1 h. The DNA is collected by 
centrifugation, dissolved in 25 ul of 1 x sample buffer and heated at 65* C for 10 min. The samples are run 
on a 0.7 % agarose gel using 1 ug of lambda DNA (BRL) digested with Hindlll as size markers. The gel is 
photographed on a UV transilluminator using polaroid 667 film. The DNA is transferred to a Schleicher and 
Schuell BA85 nitrocellulose membrane according to the method of Southern (1975) as described by 
Maniatis et aL pp. 382-386; (ref. 6). Then the membrane is baked and hybridized with the combined, 
labelled oligonucleotide mixtures HR6195 and HR6196 as described in Example 3.3. 

Fragment lengths are calculated from the photograph and autoradiograms by comparison with the 
known marker bands. The sizes of the fragments smaller than 5 kbp obtained after the digestion of DNA 
isolated from phages XD and XE with EcoRl, BamHI and the combination of these enzymes, respectively, 
are given in Table HI. With the resolving power of the electrophoresis system employed, it is not meaningful 
to compare fragments with a greater size. As judged from the presence of a few bands with a reduced 
intensity in the EcoRI digests, which are indicated in Table 111, EcoRI has not completely digested the DNA. 
The fragments /ihat are present in the EcoRI/BamHI double digest, but not in either one of the single 
digests, have intensities comparable with those of BamHI fragments in the same size range. Therefore they 
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must represent completely digested products. The fragments that hybridize with the combined 
oligonucleotide mixtures HR6195 and HR6196 are, in the case of XE, a fragment larger than 10 kbp in the 
BamHI digest, a 7.4 kbp EcoRl fragment and a fragment larger than 10 kbp in the EcoRI digest and a 2.8 
kbp fragment as well as a fragment larger than 10 kbp in the BamHI/EcoRI double digest. 

5 In the case of XD the fragments that hybridize with the combined oligonucleotide mixtures HR6195 and 

HR6196 are: a fragment larger than 10 kbp in the BamHI digest. a^S.B kbp fragment and a fragment larger 
than 10 kbp in the EcoRI digest and a 1.2 kbp fragment as well as a fragment larger than 10 kbp in the 
BamHI/EcoRI double digest When more fragments are observed, e.g. in the digests with EcoRI, the larger 
fragment is regarded as a product which arises from partial cleavage. The hybridization of fragments of 

10 equal size obtained from XD and XE with the oligonucleotide mixtures is not observed. 

From the differences between phages XD and XE enumerated above it is concluded that phages XD and 
XE are not identical. However, Table III lists two EcoRI fragments, three BamHi fragments and at least four 
BamHt-EcoR! fragments that have the same size in XD and XE. These fragments must originate from the 
inserts of these phages, since the vector does not contain EcoRI or BamHI sites, except for the BamHI sites 

15 that were used to insert the fragments of Aspergillus DNA in the vector and the EcoRI sites flanking these 
BamHi sites and thus flanking the insert as well (Frischauf et aL . ref. 9). Therefore, it is concluded that the 
inserts of phage XD and XE have been derived from the same region of the A. niger genome. In 
combination with the fact that the relevant hybridizing BamHI-EcoRl fragment from XD is smaller than the 
corresponding fragment from xE. e.g. 1.2 kbp versus 2.8 kbp, and under the assumption that the observed 

20 differences in the restriction pattern are not the result of a cloning artefact, it follows that the EcoRI site of 
the hybridizing 1.2 kbp BarnHI-EcoRI fragment of XD is not derived from A. niger DNA and is one of the 
EcoRi sites that flank the insert of this phage. In addition, it follows that irTone direction the insert of XD 
extends at the most 1.2 kbp beyond the sequence that hybridizes with the oligonucleotide mixture. 
Moreover, it is inferred that the sequence of XE that hybridizes with the oligonucleotide mixture is located 

25 within 1 .2 kbp from the BamHI site at the border of the hybridizing 2.8 kbp BamHI-EcoRI fragment (Fig. 1). 
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Table HI: The approximate size of the fragments (in kbp) smaller than 5 kbp 

obtained after the digestion of DNA isolated from phages XD and XE with 
the restriction enzymes EcoRI, BamHI and the combination of these 
enzymes, respectively. (P) denotes fragments that probably result from 
partial cleavage. 
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40 



45 



50 



55 



EcoRI 



XD 



3.8 



3 

2 



(P) 



0. 68 



XE 



3 
3 



8 

5 (P) 



2.4 



0.98 



BamHI 



XD 



2,4 
2.2 
1.6 



0.62 



Xe 



2.2 
1 . 6 



0. 62 



EcoRI 



BamHI 



4 . 4 



2 , 
2. 
1, 
1 , 
1 



0.84 
0.72 
0 . 63 
0. 62 



XE 
4 . 4 



2.8 



2 
1 
1 



0 
0 



84 
72 



0 .62 



32 



BNSDOCID:<EP 0421919A2> 



EP 0 421 919 A2 



In order to construct a partial restriction map of XE, phage DNA is digested with EcoRI, Xbal, Htndlll 
and all possible combinations- of these enzymes. This is carried out in two steps. First 6 ug of DNA is 
incubated for 105 min at 37* C either in the presence or in the absence of 200 U of EcoRI, in a volume of 
200 ul containing the buffer recommended by the supplier (BRL) plus RNase A (2 mg/ml). Then the 

5 reaction mixtures are extracted with 100 ul phenol/chloroform (1:1) and then with 100 ul of chloroform. DNA 
is precipitated from the aqueous phase by the addition of 0.1 volume of a 3M sodium acetate buffer and 2 
volumes of ethanoL After standing on ice for 10 min, the precipitates are collected by centrifugation. The 
pellets are dried in the air and dissolved in 100 ul TE buffer. Then these solutions are used for subsequent 
incubations, either in the absence of restriction enzyme (EcoRI digested material only) or in the presence of 

10 Hindlll, Xbal or the combination of these enzymes. Incubations are carried out for 2 h at 37* C in a volume 
of 20 ul. containing 10 ul of the appropriate DNA solution, RNase A (2 mg/ml), 10 U of the appropriate 
restriction enzyme and the buffer recommended by the supplier (BRL). The incubation is stopped by the 
addition of 5 ul of 5 times concentrated sample buffer and subsequently the samples are analyzed as 
described above. 

15 On the autoradiogram a single hybridizing band, which is larger than 10 kbp. is detected in the digests 
with Hindlll, Xbal or the combination of these two enzymes. A band of about the same size is present in all 
other digests, although at a lower intensity, which indicates that the enzymes have not completely restricted 
the DNA. The second and most strongly hybridizing band in the EcoRI digest represents a 7.4 kbp 
fragment. Again indicating incomplete digestion, this band is present in the digests with EcoRI and a 

20 second enzyme, although at a lower intensity. In addition, the EcoRI/Hindltl double digest and the 
EcoRI/Hindlll/Xbal triple digest give a hybridizing fragment of 3.6 k-bp. A 4.1 kbp hybridizing fragment is 
detected in the double digest with the enzymes EcoRI and Xbal. This fragment is also present in the triple 
digest with the enzymes EcoRI. Hindlll and Xbal, but at very low intensity. In the latter case, the 4.1 kbp 
fragment is regarded as a partial cleavage product. 

25 It is concluded that the 7.4 kbp EcoRI fragment that hybridizes with the combined oligonucleotide 
mixtures HR6195 and HR6196 can be cleaved with either one of the enzymes Xbal, Hindlll or BamHl and 
that the location of the cleavage site of these enzymes must be as shown in Figure 1. This figure only gives 
a partial restriction map, e.g. the 7.4 kbp EcoRI fragment might still contain more than one cleavage site for 
either one of the enzymes Hindlll, Xbal and BamHl. in which case only the site closest to the specific 

30 sequence that hybridizes with the combined oligonucleotide mixtures HR6195 and HR6196 is shown. 

With phage XD no Xbal or Hindlll digestion is performed, but it is assumed that the Xbal and Hindlll 
restriction sites are located as in phage xE. 

From the nine positive clones, which are carrying PGI gene sequences, the four phages PG I -X4, X5, 
X6 and X7 are selected for further analysis. First it is established that the phages contain inserts derived 

35 from the same region of the A. niger genome and next a partial restriction map of X5 is made as described 
above for the PGll gene sequence containing phages D and E. The DNA is digested with restriction 
enzymes (for enzymes see Table iV) as recommended by the enzyme suppliers. The fragments are 
separated on an agarose gel and blotted onto nitrocellulose as described above. Then the membrane is 
baked and hybridized with a previously nick-translated 1.2 Kb Bam Hl/Eco Rl fragment of the structural 

40 gene coding for PG It. This fragment is derived from the plasmid pQW1803 which is described in Example 
3.8, The fragment is isolated from slices of an agarose gel and 50 ng of the fragment is nick-transiated as 
described by Maniatis et a!, (pp. 109-112; ref. 6). Before adding it to the hybridization mixture the nick- 
translated DNA is denatured for 10 min in a boiling water bath. The nitrocellulose filters are hybridzed with 
the radioactive probe as described in Example 3.8 except that the prehybridization and hybridization 

45 temperature is 60 'C. Then the membranes are washed at 60 'C using a' series of prewarmed buffers. First 
the membranes are rinsed in 6 x SSG and washed twice for 0.5 h in hybridization buffer. Then the filters are 
sequentially washed in 4 x SSG and 2 x SSC for 30 min per buffer. These buffers contain both 0.1% SDS 
and 0.1% Na4P2O7Xl0H2O. After the last wash the membranes are shortly rinsed in O.lxSSC and then they 
are allowed to dry and are subsequently exposed to X-ray film. 

50 The probe used represents a small part of the PG II promoter region (200 bp) and a large part of the 
coding sequence. Under the hybridization conditions described this probe gives strong signals with ail the 
phages isolated above (see Example 3.4) containing PGI gene sequences^ 

Fragment lengths are calculated from the photograph and autoradiograms by comparison with the 
known marker bands. In Table IV the hybridizing fragments and their approximate sizes are listed. 

55 
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Table IV: The approximate size of the hybridizing fragments (kbp) obtained after 
digestion of DNA isolated from PG I with the restriction enzymes 
EcoRI, BamHI. Xho l and combinations of these enzymes as well as by 
Bam HI/Bglll 

Phage 



Restriction enzyme 


X4 


X5 


xe 


XI 


BamHI 


8.6 


8 . 6 


^ 23 


8.6 


EcoRI 


6.4 


6.4 


6.4 


6.4 


EcoRI/BamHI 


6.4 


6-4 


6.4 


6.4 


Xhol 


> 23 


> 23 


> 23 


> 23 


XhoI/EcoRI 


2 . 0 


2.0 


2.0 


2 . 0 


BamHI/BcTlII 


7.5 


7.5 


> 23 


7.5 



From Table IV It is clear that al! phages contain an Eco Rl fragment of 6.4 kbp and a Xho l/Eco R! 
fragment of 2,0 kbp whereas an Eco Rl/Bam HI double digest also results in a 6.4 kbp fragment. The 
phages X4, X5 and X7 have also identical fragments when digested with Bam HI (8.6 kbp) and Bam Hl/Bgl II 
(7.5 kbp). The vector does not contain Bam HI sites except for the Bam HI sites used to insert the 
fragments of Aspergillus DNA (Frischauf et al ref. 9). The inserts of these three phages are therefore 
considered to be derived from the same region of the A. niger genome. 

In order to construct a partial restriction map X5 phTge DNA is also digested with Kpn 1, Sma I, sst I and 
Sal I and with Bam HI in combination with these enzymes. Moreover, a Bgl ll/ Xho I digest is analyzed. All 
these incubations are also carried out for 3 h at 37*0. except for Sma I (30 C), in a volume of 20 ul 
containing 1 ug phage DNA. The fragments hybridizing vvith the PG II probe at 60° C are listed in Table V. 



35 Table V: The approximate size of the hybridizing fragments (kbp) obtained after digestion 
of DNA isolated from PG I-?.5 with the restriction enzymes Bgl lL Kpn l, Smal, Ssil, Sai l 
and %vith combinations of Bam HI with these enzymes or with Xho l. Also a double digest 
of BgllT/XhoI is listed. 

40 

Fragment length 

Fragment length 



Restriction enzyme 


(kbp) 


Restriction enzyme 


(kbp) 


BamHI 


8 . 6 






Bqlll 


9.7 


BamHI/Bqlll 


7.5 


Kpn I 


5.1; 2.7 


BamHI/XhoI 


4 . 2 


EcoRI 


6.4 


Bqlll/Xhol 


3.0 


Smal 


4.7 


BamHI/KonI 


4.8; 2.8 


SstI 


9.9 


BamHI/SstI 


8.3 


Sail 


8.9 


BamHI/Sall 


8.5 
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A restriction map of PG I-X5 can be designed from the hybridizing fragments from which it can be 
concluded that the gene coding for polygalacturonase I Is located on the 8.6 kbp Bam HI fragment. 



5 Example 3.7: The construction of pGWI 800 and pGW1 803 

The relevant EcoRI-Xbal restriction fragments of phages XD and XE are inserted into pEMBL vectors 
(Dente and Cortese, ref. 10), resulting in plasmids pGW1803 and pGWIBOO, respectively. 

From Fig. 1 and Example 3.5 it is inferred that phage XD must have a 2.5 kbp EcoRI-Xbal fragment that 

10 is identical to part of the 4,1 kbp EcoRI-Xbal fragment of phage XE. 6 ug of XD DNA are digested with 60 U 
of Xbai for 2 h at 37" C in a volume of 200 ul in the buffer recommended by BRL plus RNase A (1.5 
mg/ml). Then the NaCI concentration is adjusted to 140 mM, concentrated reaction buffer is added to 
compensate for the increase of volume, 60 U of EcoRI are added and the incubation is continued for 2.5 h 
in a volume of 230 ul. Then the reaction mixture is extracted with 100 ul of chloroform and the DNA is 

15 precipitated from the aqueous phase by the addition of 0.1 volume of 3 M sodium acetate buffer (pH 5.2) 
and 2 volumes of ethanoL After incubation at 4* C for 16 h the DNA is recovered by centrifugation, the 
pellet is dried in the air and then dissolved in sample buffer The restriction fragments are separated on a 
0.6 % low melting point agarose (BRL) gel in 0.5 x TBE buffer containing ethidium bromide. The DNA 
fragments are visualized under UV light and a gel slice containing the 2,5 kbp EcoRI-Xbal fragment is 

20 isolated. 

Phage XE DNA is incubated with the restriction enzymes Xbal and EcoRI, essentially as described 
above. Following extraction with chloroform, the DNA is precipitated, pelletted by centrifugation, dissolved in 
sample buffer and subjected to electrophoresis on a 0.6 % agarose gel in 1 x TBE buffer, A gel slice 
containing the 4,1. kbp Xbal-EcoRI fragment is recovered and it is stored at -20 *G. A plug of silanized 

25 glass wool is placed at the bottom of a 0.5 mi micro-centrifuge test tube containing 60 ul of high TE buffer 
(100 mM Tris-HCI, pH 8.0, 10 mM EDTA). The gel slice is thawed, put in the test tube and the tube is 
frozen in liquid nitrogen. A small hole is punched in the bottom of the small tube and it is placed in a 1.5 mi 
micro-centrifuge test tube and the buffer and DNA from the gel slice are collected by centrifugation. The 
remnants of the gel slice are resuspended in 50 ul of high TE buffer, this suspension is frozen again in 

30 liquid nitrogen and the buffer is collected by centrifugation. Then the eluates are combined and extracted 
with 100 ul of chloroform. The DNA is precipitated from the aqueous phase, collected by centrifugation and 
dissolved in 40 ul of TE buffer. The DNA concentration is estimated by agarose gel electrophoresis 
followed by visualisation of the band under UV light, using 1 ug of lambda DNA (BRL) digested with Hindlll 
as a reference. 

35 pEMBLlS and pEMBL19 vectors are prepared by sequential digestion with Xbal and EcoRI, under the 
conditions recommended by the supplier (BRL), The extraction of DNA with phenol, phenol/chloroform (1:1) 
and chloroform and the precipitation of DNA with ethanol are carried out as described by Maniatis (ref. 6), 
except for the omission of isoamylalcohol from the chloroform solution and the temperature used for the 
precipitation of DNA, is 4'C. 

40 DNA fragments are ligated to the appropriate vector in a reaction volume of 25 ul, containing the buffer 
recommended by BRL plus ATP (1 mM), 1.5 U of T4 DNA ligase (BRL), 100 ng of vector DNA, which has 
been prepared as described above, and the relevant restriction fragment. Xbal and EcoRI digested 
pEMBLIS is used for the construction of pGWISOO plus an equimolar amount of the purified 4.1 Xbal-EcoRl 
fragment of XE. 

45 The reaction mixture is incubated for 16 h at 16*0 and then the reaction is stopped by the addition of 
125 ul of distilled water and freezing. Xbal and EcoRI digested pEMBLIG used for the construction of 
pGW1803. In this case, the slice of agarose that contains the 2.5 kpb Xbal-EcoRI fragment of D is molten at 
60*0 and 4 ul, containing ca. 15 ng of DNA, are added to 11 ul of distilled water, after which the other 
components of the reaction mixture are added. The ligation reaction is allowed to proceed for 14 h at 20* C. 

50 then 1 additional unit of T4 DNA ligase is added. The reaction is continued for another 7 hrs and then the 
reaction is stopped as described above, 

50 ul of the resulting ligation reaction mixtures are used to transform E. coli as described in the BRL 
Ml 3 cloning/dideoxy sequencing manual (pp. 30-33; ref. 11) except that E.'cbirDH5aF' and E. coli JM 109 
are grown to an OD550 value of 0.7 and 0.9, respectively, before placing tTie flasks on ice. The"former strain 

55 is transformed using the ligation reaction mixture with the fragment of XD, the latter strain is transformed 
using the ligation reaction mixture with the fragment of XE. After the heat shock the cells are incubated on 
ice for two min and then 1 ml of LB medium is added and the cells are incubated at 37' C for 1 h. The ceils 
are pelletted by low speed centrifugation. 1 ml of the supernatant is removed and the cells are gently 

35 

BNSDOClD:<EP 0421919A2> 



EP 0 421 919 A2 



resuspended. Then the cells are plated on LB agar plates containing 100 ug/ml annpicillin and on which an 
IPTG/X-gal solution (= 200 ii\ H2O, 30 ul dimethylformamide containing 2 % X-ga) and 20 ut of a 24 mg/ml 
IPTG solution in water) had been spread. The plates are incubated overnight at 37" C. 

Several single white colonies are used to prepare overnight cultures in LB nnediunn supplemented with 

5 0.1 % glucose and 75 ug/mi ampicillin. These cultures are used to isolate plasmid, using the miniprep 
method of Holmes and Quigley (ref. 12). The plasmids are digested with several restriction enzymes, 
according to the recommendations of the supplier (BRL) and in the presence of RNase A (0.5 mg/ml). and 
the products are analyzed on an agarose gel. Plasmids that give rise to Xbal-EcoRI, BamHI-EcoRI and 
Hindlll fragments of the expected size are selected and the E. coli cells harbouring them are kept on 

70 glycerol at -20* C. 

Two new plasmids are constructed. pGW1800 is the 4.1 kbp Xbal-EcoRI fragment of phage XE inserted 
into the vector pEMBLIS. pGW1803 is the 2.5 kbp Xbal-EcoRI fragment of phage XD inserted into the 
vector pEMBL19. 

Example 3.8: Restriction analysis of pGW1800 and pGW1803 



The relationship between pGWIBOO and pGW1803 is confirmed by extended restriction analysis. The 
specific sequence that hybridizes with the combined oligonucleotides HR6195 and HR6196 is designated to 

20 a specific region of pGWI803. The absence of cloning artifacts, such as deletions or ligation of A. niger 
derived DNA fragments, is established by comparison of restriction fragments obtained from plaimid 
pGWIBOO and from A. niger N400 DNA, respectively. 

pGW1800 and pGW1803 are propagated in E. coU strain JM109 and DHSap', respectively, and plasmid 
DNA is recovered from 250 ml overnight cultureslnTB medium supplemented with 0.1 % glucose and 100 

25 ug/m\ ampicillin as described by Maniatis et ai. (pp 90-91; ref. 6) and finally resuspended in TE buffer. 
Since pGWiSOO is also used to transform A7 "niger . it is purified by banding in a cesium chloride gradient. 
To 2.5 mi DNA solution in TE. 3.3 g CsCl and 1 ml ethidium bromide (10 mg/m! in water) are added. 
Centrifugation takes place in a Beckman® VTi 65.2 rotor at 20* C for 16 h at 45,000 rpm. Of the two 
fluorescent bands visible under UV light, the lower one containing covalently closed circular plasmid DNA is 

30 recovered by side puncturing of the tube. The DNA solution (approx. 1 ml) is extracted 5 times with water 
saturated butano! to remove the ethidium bromide. Then the volume is adjusted to 15 ml with water and the 
DNA is precipitated by the addition of 30 ml of ethanol. The DNA is coilected by centrifugation and next the 
pellet is washed once with 75 % ethanol and subsequently it is dissolved in TE buffer and stored at -20 ' C. 
pGW1803 is not isolated from a CsCl gradient, but it is subjected to a faster procedure. RNase A is added 

35 to 4 ml DNA solution to a concentration of 100 ag/ml and this mixture is incubated at 37'C for 1 h and 
subsequently extracted with an equal volume of phenol, phenol/chloroform (1:1) and chloroform. Then the 
DNA is precipitated from the aqueous phase by the addition of 0.4 ml 3M sodium acetate buffer pH 5.2 and 
8 ml ethanol. The DNA is collected by centrifugation, the resulting pellet is washed with 75 % ethanol and 
dried in the air and subsequently dissolved in TE buffer. 

40 niger DNA is isolated by a slightly modified procedure used to isolate plant RNA (Slater, ref. 21). 

Mycelium is washed with saline, frozen in liquid nitrogen and 2.5 g is disrupted using a microdismembrator 
(Braun). The mycelial powder is extracted with freshly prepared extraction buffer. The extraction buffer is 
prepared as follows: 5 ml tri-isopropylnaphthalene sulphonic acid (TNS, 20 mg/ml) is mixed with 5 ml p- 
aminosaiicylic acid (PAS) (120 mg/ml) and 2.5 ml 5 x RNB buffer (5 x RNB contains 121.1 g Tris. 73.04 g 

45 NaCI and 95.1 EGTA in 1 I, pH 8.5). After the addition of 7.5 mi phenol, the extraction buffer is equilibrated 
for 10 min at 55 *C. The warm buffer is then added to the mycelial powder and the suspension is mixed for 
2 min. Then 5 ml chloroform is added and mixed for 2 min. The phases are separated by centrifugation (10 
min, 10.000 x g) and the aqueous phase is extracted once more with 10 ml of phenol/chloroform (1:1) and 
then twice with chloroform. 

50 The aqueous phase contains both RNA and DNA. The DNA is precipitated with 2 vol ethanol at room 
temperature and collected by centrifugation (10 min, 10.000 x g). washed twice by redissolving in distilfed 
sterile water and precipitating it again with ethanol. RNA is removed by adding RNase A (20 M-g/ml) to the 
final solution. 

In order to construct physical maps of pGWIBOO and pGW1803. several reaction mixtures are prepared 
55 that contain approximately 1 ug of plasmid DNA, RNase A in concentration of 1 mg/ml (only in the case of 
pGW1803). the relevant buffer as recommended by BRL and 10 units of a restriction enzyme, whereas 
different enzymes are chosen for the separate reaction mixtures. In several cases a combination of two 
different enzymes is chosen, or plasmid DNA is restricted with a certain enzyme or combination of 
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enzymes, followed by precipitation of the DNA in the presence of sodium acetate and ethanol and 
subsequent recollection of the DNA by centrifugation, which material is then used as substrate for a second 
or third restriction enzyme. The reaction mixtures are incubated at 37 'C for 1 h, then the reaction is 
stopped by the addition of 5 times concentrated sample buffer and next the reaction products are analyzed 

5 on an 1 % agarose gel in TBE buffer. From the calculated size of the specific fragments the position of the 
individual restriction enzyme sites relative to a reference point, which is arbitrarily chosen at the unique 
Xbal site, is inferred. The restriction map of pGWISOO is given in 'figure 2. The restriction map of the A. 
niger derived DNA of pGW1803 (not shown) is identical with the map of pGWISOO from the Xbal site at 
position 1 to the Hindi site at position 2000. However, pGW1803 does not contain a Bglll site, which is 

10 present at position 2400 in pGWISOO. The 0.4 kbp Hincll-Bglll fragment of pGW1800 showed the same 
electrophoretic mobility as the HincII-EcoRI fragment of pGW1803 which is located at the end of the A. 
niger derived insert DNA. It is concluded that the A. niger derived DNA of pGW1803 is identical with the K 
niger derived DNA of pGW1800 between the Xbal site at position 1 and a location very close to, or at, but 
not including, the Bglll site at position 2400. Thus, the inferred size of the EcoRI-Xbal fragment of phage 

15 XD. 2,5 kbp (Example 3.6), presents a slight over-estimation. 

In order to locate the specific sequence that hybridizes with the combined oligonucleotide mixtures 
HR6195 and HR6196, pGW1803 DNA is restricted and the resulting fragments are separated as described 
above. Then the DNA is transferred to a nitrocellulose membrane and is allowed to hybridize with the 
combined and labelled oligonucleotide mixtures HR6195 and HR6196, as described hereinbefore. The 

20 fragments that hybridize with the combined oligonucleotide mixtures HR6195 and. HR6196 are a 0.64 kbp 
Hindi fragment in the Hindi digest and a 0.62 kbp NcoI-EcoRI fragment in the Ncol/EcoRI double digest. It 
is concluded that the specific sequence that hybridizes with the combined oligonucleotide mixtures HR6195 
and HR6196 is located between the Ncol site at position 1750 and the Hindi site at position 2000 of the 
restriction map of pGW1803, which coordinates correspond with the same sequence on the restriction map 

25 of pGWISOO. 

The absence of cloning artifacts is established by comparing restriction fragments from pGWISOO with 
restriction fragments obtained from A. niger N400 DNA. Digests of pGWISOO are prepared as described 
above. Digests of chromosomal DNA of strain N400 are made at 37' G in a reaction volume of 200 ul 
consisting of 2 ug DNA. 0.5 mg/ml RNase A. a reaction buffer supplied by BRL (REact Buffer 2 for Xhol, 

30 Xbal. EcoRV and Bglll. REact Buffer 4 for Kpnl and REact Buffer 3 for EcoRi and Sail) and 80 units for 
each restriction enzyme used. The reaction mixture is incubated for 2 h and then extracted with 100 ul of 
chloroform. Subsequently the DNA is precipitated from the aqueous phase by the addition of 20 ul of a 3 M 
sodium acetate buffer pH 5.2 and 0.5 ml ethanol followed by standing on ice for 45 min. Then the DNA is 
recollected by centrifugation and dried on the air. The DNA is dissolved in sample buffer and these samples 

35 and the digests of pGWISOO and lambda DNA size markers are loaded on an 0.7 % agarose gel in TBE 
buffer. After electrophoresis the resulting pattern is documented and the DNA in the lanes with the digests 
of A. niger chromosomal DNA is transferred to a nitrocellulose membrane, both as described hereinbefore. 
The membrane is baked and cut in two parts, which are hybridized with different probes. 

The probes used for hybridization are the nick-translated 1.45 kbp and 2.05 kbp EcoRV-Bglll fragments 

40 of pGWISOO. The fragments are previously isolated from slices of an agarose gel as described hereinbefore 
and 100 ng of the fragments are nick-translated in separate reactions as described by Maniatis et al. (pp 
109-112; ref. 6). Before adding it to the hybridization mixture the nick-translated DNA is denatured lor 10 
min in a boiling water-bath. 

The baked nitrocellulose filters are wetted in 3xSSG and are then transferred to separate dishes 

45 containing 100 ml prewarmed (68' C) hybridization buffer which consists of 6xSSC, 10 mM EDTA, 0.1 
mg/ml freshly added sheared and denatured herring sperm DNA. 0.1 % Na^PsOTXIOHaO. 0.5 % SDS and 
5x Denhardt's (0.1 % BSA, Boehringer fraction V, 0.1 % Ficoll 400, Pharmacia; 0.1 % polyvinylpyrrolidone- 
10, Sigma). Prehybridization is for 2 h at 68" C in a shaking water bath and then the hybridization buffer is 
replaced with fresh buffer (75 ml) to which a nick-translated probe is subsequently added. The hybridization 

50 is allowed to proceed at 68* C for at least 16 h Then the membranes are washed at 68* 0, using a series of 
prewarmed buffers that all contain 0.1 % SDS and 0.1 % Na4P2O7Xl0H2O, but. with a decreasing quantity 
of SSC. First, the membranes are rinsed in 4xSSC and then they are washed in 4xSSC twice for 10 min. 
Then the filters are sequentially washed in 4x880, 2xSSG, 0.5xSSC, 0,2xSSC and O.lxSSC for 30 min. per 
buffer. After the last wash the membranes are shortly rinsed in O.lxSSC and then they are allowed to dry 

55 and they are subsequently exposed to X-ray film. 

The membrane probed with the 1.45 kbp EcoRV-Bglll fragment contains the following hybridizing 
fragments: a 2.4 kbp fragment in the Xhol digest, a 2.8 kbp and a 1.2 kbp fragment in the Kpnl digest and a 
1.3 kbp fragment in the Bglll/EcoRV double digest. The membrane probed with the 2.05 kbp EcoRV-BglH 
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fragment contains the following fragments: a large fragment of approximately 1 1 kbp in the EcoRI/Sal! 
double digest, 2.3 kbp and 2.1 kbp fragments in the Xhol digest. 2.3 kbp and 1.4 kbp fragments in the 
Xho/Xbal double digest, 1.2 kbp fragments in the Kpnl digest and a 2.0 kbp fragment in the EcoRV/Bglll 
double digest. 

5 For four of the hybridizing fragments only the existence and a minimum size can be inferred from the 

restriction map of pGW1800, i.e. fragments which have homology with a probe used but of which one end 
is outside the sequence which is present in pGWIBOO. These fragments are. in the case of the 2.05 kbp 
EcoRV-Bglll probe, the fragment in the EcoRI/Sall digest, the 2.1 kbp fragment in the Xhol digest and the 
3.2 kbp fragment in the Kpnl digest, and . In the case of the 1.45 kbp EcoRI-Bgill probe, the 2,8 kbp 

10 fragment in the Kpnl digest. These fragments are all larger than the minimum size which is inferred from 
the restriction map of pGWISOO, The different hybridizing fragments of genomic A. niger DNA can be 
correlated with fragments present in the digests of pGW1800 and ail have a calculated ^ize which is very 
close to the size which is inferred from the restriction map of pGWISOO. It is concluded that the A. niger 
derived DNA of pGWIBOO is a faithful representation of part of the A. niger N400 genome. 

75 

Example 3.9: The construction of pGW1900 and pGW1902 and their restriction analysis. 



The 8.6 kbp Bam HI restriction fragment of phage PGI-\7 which hybridizes with PGII coding sequences 

20 (see Table IV, Example 3.6) is inserted into the pUC9 vector (Vieira, J. and Messing. J., 1982, Gene 19, 
259-268) resulting in plasmids pGW1900 (see Figure 3) and pGW1901 which have the insert in opposite 
orientation. To 1.5 ug PGI-X7 DNA in 5 ul TE buffer, 1 ui spermidine of a 10 mM stock solution, 20 U Bam 
HI, the reaction buffer recommended by BRL and sterife distilled water are added to a final volume of SOuL 
To the digested samples one quarter (v/v) of loading buffer is added (loading buffer = 0.25% bromophenol 

25 blue; 0.25% xylene cyanol and 15% Ficoll 400 in H2O). The reaction fragments are separated on a 0.6% 
agarose gel in 1 x TAE buffer containing ethidium bromide (1 ug/ml) (50 x TAE buffer = 242 g Tris, 57.1 
ml acetic acid and 100 ml 0.5 M EDTA pH 8.0 per titer). 

The DNA fragments are visualized under UV tight and a gel slice containing the 8.6 kbp Bam HI 
fragment is isolated. DNA fragments are eiectrophoretically isolated according to conventional rnethbds 

30 Elution is performed at 100 V during 1 h. The DNA is collected and precipitated with 2 vol of.ethanol. After 
centrifugation in an Eppendorf centrifuge for 30 min the precipitate is collected, dried in a Speedvac and 
dissolved in 10 al TE buffer. Vector pLIC9 (1 ug) is linearized using 10 U Bam HI for 37° C during 1.5 h 
using the recommended BRL buffer in a total reaction volume of 20 ul. Subsequently 1 al CIP solution 
(approx. 2U) is added and the mixture is incubated for 30 min at 37" C. The linearized vector is also 

35 prepared by electrophoresis. It is dissolved in 20 ul TE which corresponds with a DNA concentration of 
approx. 50 ng/ul. 

In the ligation reaction one ul of the linearized and CIP treated pUC9 vector (50 ng) and 4 al of the 
8am HI fragment (approx. 100 ng) are ligated with 1.2 U T4 DNA ligase in appropriate ligase buffer. The 
final reaction volume is 10 ml. The reaction is allowed to proceed for 1 4 h at 1 4° C and then the mixture is 

40 diluted with TE to 50 ul. 

50 ul of the resulting ligation reaction mixtures are used to transform E. coii as described in the BRL 
M13 cloning/dideoxy sequencing manual (pp. 30-33; ref, 11) except thatT."coli DH5aF' is grown to an 
ODsso value of 0.5 to 0.7 before placing the flasks on ice. After the heat shocklhe cells are incubated on 
ice for two min and then 1 ml of LB medium is added and the cells are incubated at 37* C for 1 h. The ceils 

45 are pelletted by low speed centrifugation, 1 ml of the supernatant is removed and the cells are gently 
resuspended. Then the cells are plated on LB agar plates containing 100 ug/mi ampicillin and on which an 
IPTG/X-gal solution (= 200 ul H2O, 30 ul dimethylformamide containing 2 % X-gal and 20 ul of a 24 mg/ml 
IPTG solution in water) had been spread. The plates are incubated overnight at 37*0. Several single white 
colonies are used . to prepare overnight cultures in LB medium supplemented with 0.1 % glucose and 75 

50 ug/ml ampicillin. These cultures are used to isolate plasmid, using the miniprep method of Holmes and 
Quigley (ref. 12). The plasmids are digested with several restriction enzymes, according to the rec- 
ommendations of the supplier (BRL) and in the presence of RNase A (0.5 mg/ml). and the products are 
analyzed on an agarose gel. Plasmids that give rise to BamHI fragments of the expected size are selected 
and the E. coli ceils harbouring them are kept on glycerol at -20* C. The new plasmids pGW1900 (shown in 

55 Figure 3) and pGWl901 carrying the insert in the opposite direction are used for further experiments. 

pGW1900 is propagated in E. col[ strain DH5aF' and plasmid DNA is recovered from 250 ml overnight 
cultures in LB medium supplemented with 0.1 % glucose and 100 ug/ml ampicillin as described by 
Maniatis et aL (pp 90-91; ref. 6) and finally resuspended in TE buffer. Since pGW1900 is also used to 

38 



8NSDOCID:<EP 0421919A2> 



EP 0 421 919 A2 



transform A. niger . it is purified by banding in a cesium chloride gradient. To 2.5 nnl DNA solution in TE, 
3.3 g CsCI and 1 ml ethidium bromide (10 mg/ml in water) are added. Centrifugation is performed in a 
Beckman® VTi 65.2 rotor at 20* C for 16 h at 45,000 rpm. Of the two fluorescent bands visible under UV 
light, the lower one containing covalently closed circular piasmid DNA is recovered by side puncturing of 

5 the tube. The DNA solution (approx. 1 ml) is extracted 5 times with water saturated butanol to remove the 
ethidium bromide. Then the volume is adjusted to 15 ml with water and the DNA Is precipitated by the 
addition of 30 ml of ethanol. The DNA is collected by centrifugation and next the pellet is washed once with 
75 % ethanol and subsequently it is dissolved in TE buffer and stored at -20* C. 

In addition, from a pGW1900 Kpn I digest a 2.7 kbp Kpn 1 DNA fragment is electrophoretically isolated 

70 from a slice of agarose gel and is Inserted into pEMBL18 (Dente & Cortese, ref. 10) resulting in plasmids 
pGW1902 (see Figure 4) and pGWl903, respectively. 

In order to construct a physical map of pGW1900 and pGW1902 several reaction mixtures are prepared 
that contain approx. 1 ug of piasmid DNA, the relevant buffer as recommended by BRL and 10 units of a 
restriction enzyme or a combination of two different restriction enzymes. The reaction products are 

75 analyzed on 1% agarose gels in TAE buffer. From the calculated size of the specific fragments the position 
. of the individual restriction enzyme sites in Figure 3 and 4, respectively, is established. 



Example 3.10: Molecular cloning of the polygalacturonase II gene (pgall) of A.niger NW756 

20 

The genomic DNA of A. niger NW756 is analyzed by Southern blot analysis using the pgall gene of A. 
niger N400 (i.e. the 1.2 kbp"BamHI-Bglll fragment of pGWISOO) as a probe. With respect to what has been 
described in Example 7.1. two modifications are made. In this case restriction enzymes are chosen which 
cut within the structural moiety of the A.' niger N400 pgall gene and the hybridization condition?; are more 

25 stringent, i.e. the ''homologous" conditions described in Example 7.2. In the BamHi digest now a single 
hybridizing fragment of 3.3 kbp is detected. Under these hybridization conditions a single sequence is 
detected, which sequence is therefore defined as the pgall gene of A. niger NW756. A single hybridizing 
Hindi fragment of 3.3 kbp is observed, which indicates the absence of a Hindi site in the structural gene. 
The hybridizing Xhol-Bglll fragment is 5.5 kbp. These results are in disagreement with the data given in 

30 Example 3.7 and the DNA sequence of pgall of A. niger N400 depicted in the sequence listing under SEQ 
ID No. 2. Therefore, it is concluded that the pgall gene of A. niger NW756 is not identical to the pgall gene 
of A. niger N400. 

A genomic library of A. niger NW 756 is constructed as described for strain N400 in Exarrple 2 with 
minor modifications. $au3AI is used to restrict the Aspergillus DNA, instead of its isoschizomer Mbol. The 

35 library of strain NW756 is constructed in the lambda vector EMBL3, which is a close relative of Er.'lBL4 (ref. 
9). EMBL3 DNA, already digested with BamHI and EcoRI and phosphatase treated, has been purchased 
from Promega. Part of the resulting library is plated and plaque lifts are prepared as described in Example 
3.3. The filters are then hybridized with the 1.2 kbp BamHI-EcoRI fragment of pGW1803. using the 
homologous conditions as described in Example 7.2. Positive phages are purified in a rescreening step and 

40 subsequently the DNA of these phages is purified as described in Example 3.4. The DNA is subjected to 
restriction analysis, using the enzymes Bglll and Xhol. Next, the 5.5. kbp Xhol-Bglll fragment of cne of the 
positive phages that contain this fragment is isolated as described in Example 3.6. This fragment is then 
ligated into BamHI and Sail digested pEMBLIS and the resulting ligation mixture is used to transform E. coli 
JM109 (Example 3.6). The transformants are then analyzed by colony hybridization under "heterologous" 

45 conditions as described in Example 7.3. The piasmid DNA of a positive clone is purified and it is 
subsequently used to construct a physical map (Example 3.7). 

The restriction map of pGW1756 is shown in Fig. 6. pGW1756 is the 5.5 kbp Xhol-Bglll fragment which 
contains the A. niger NW756 pgall gene inserted into the vector pEMBL18. 

50 

Example 4: The determination of the nucleotide sequence of the polygalacturonase genes 



Example 4.1: The polygalacturonase II gene (pgail) of A. niger N400 

Suitable restriction fragments of pGWISOO and pGW1803 are isolated after agarose gel electrophoresis 
and ligated into M13mp18RF and (VI13mp19RF vectors, using a 2-10 fold excess of fragment with respect 
to the vector. Transformation of E. coli JM109 is carried out as described hereinbefore whereas after the 

39 



BNSOOCID: <EP 0421919A2> 



EP 0 421 919 A2 



heat shock the cells are immediately plated as described in the BRL M13 cloning/dideoxy sequencing 
manual (p 34; ref. 11). Isolation of single-stranded DNA templates from recombinant phages is performed as 
described by Ausubel et al . (section 7.3.9; ref. 40) whereas the ceils are harvested in addition to the 
supernatants and these cells are subsequently stored as a glycerol culture at -20* C. 

5 One of the clones thus obtained, which is the 2.4 kbp Xbal-EcoRI fragment of pGW1803 inserted into 

the poiylinker of M13mp18, is subsequently manipulated in order to'be able to obtain. sequencing data from- 
the Hindlll, PstI, BamHI and Ncol sites respectively. Separate overnight cultures in YT medium are prepared 
from E coli JM109 and from the relevant clone, in the latter case using part of the glycerol culture as an 
inoculum. Then 200 ml of 2xYT medium is inoculated with 0.4 ml of the JM109 culture and this culture is 

10 subsequently incubated for 2.5 h in an orbital shaker at 37' C. Then 20 ml of the overnight culture of the 
relevant clone is added and the incubation in the orbital shaker is continued for 5 h. Replicative form DNA is 
then isolated from these cells as described for pGW1803 hereinbefore. This DNA is then digested with 
Hindlll, Pstl. BamHI and Ncol, respectively. The BamHI and Ncol digested DNA is then digested with Sail, 
subsequently e>ctracted with phenol/chloroform (1:1) and chloroform and is then ethanol precipitated. The 

75 non- compatible sticky ends are then filled in using 5 units of T4 DNA polymerase (BRL) in the buffer as 
described by Maniatis et al . (p 117; ref. 6) and in the presence of 0.1 mM each of dCTP, dATP, dGTP and 
dTTP. The reaction mixtures with a volume of 25 u( are incubated for 5 min at 37* C and then 5 ul of 0.5 M 
EDTA pH 8.0 is added' and the mixtures are subsequently phenol extracted. The small DNA fragments 
present in the four DNA preparations thus obtained, are removed by electrophoresis in a 0.7% iow melting 

20 point agarose gel (BRL) and gel slices that contain the large fragmervts are isolated. These DNA fragments 
are subsequently circularized using T4 DNA Jigase. The resulting ligation reaction mixtures are subse- 
quently used to transform E. coli JM109 as described above. 

Since Bell is sensitive to methylation of the substrate it can not cut the plasmid DNA isolated from 
E.coli JM109 or E. coii DHSap'. Therefore, the previously isolated pGW1800 and pGW1803 are used to 

25 transform E. coli JM110 (Yanisch-Perron et al., ref. 29) and the plasmid DNA is subsequently isolated as 
de5 :::-ibed in Example 3.8, except for the addition of 0.1% casamino acids to all media used to grow E. coli 
Jfv: 0. The plasmid DNA isolated from E. coli JM110 is then used to obtain subclones which~ailow 
se- i.nce analysts from the Bell site. 

■ he resulting clones are sequenced with the T7sequencingTM Kit from Pharmacia, using the conditions 

30 re' c .mended by the supplier. In some cases the sequencing data thus obtained are used to program the 
s> -r-iesis of specific oligonucleotides according to the method of Caruthers (ref. 8). These oligonucleotides 
are HR6425, which is 

5^ (d)CAAGAACGTCAGCATCGAAG3'. HR6439, which is 
5/d)GAATTGCTCACGGTGGAGTG3' and HR6440, which is 

35 5 (d)ACTTGGGCTTCTTC ( 1 I CCG3 . These oligonucleotides are subsequently used as specific sequencing 
primers for the relevant templates. The primer HR6439 gives two superimposed sequencing ladders when 
used on M13 templates and therefore this primer is used for double strand sequencing of alkali denatured 
pGWISQO, following the instructions of Pharmacia. 

The sequence of the gene encoding PGIl ( pga II) located on pGW1800 is obtained from both strands of 

40 DNA. Sequencing in the downstream direction ~(fr^m the Xbal site at position 1 in the direction of the Pvull 
site at about position 3028) is performed from the Xbal(1), EcoRV(about 334). Hindlll(about 557), Pstl (about 
827), BclI(about 1056), BamHI(about 1158). Hindi (about 1344 and about 1974), Kpnt (about 1565 and about 
2706), Ncol(about 1749) and the Bglll(about 2379) sites, respectively, whereas the primer HR6425 is used 
to sequence the region of the Bglll site. Sequencing in the opposite direction is performed from the Pvuii 

45 (about 3028 and about 1442), Kpnl (about 2706 and about 1565). Bglll(about 2379). Hincll(about 1974). 
BamHI(about 1158). Bcll(about 1056), Pstl(about 827), HindIII(about 557) and EcoRV(about 334) sites, 
respectively, whereas the primers HR6439 and HR6440 are used to sequence over the Hincll(about 1974) 
and Kpnl (about 1565) sites, respectively. 

The sequence of pga II is given in the sequence listing under SEQ ID NO. 2. The 3031 bp sequence 

50 starts with the first nucleotide of the Xbal site at position 1 in pGWISOO and ends at the last nucleotide of 
the Pvull site indicated at the approximate map position 3050 in the restriction map of pGW1800. The pga II 
gene comprises 1356 nucleotides of the promoter region, 1 138 nucleotides of the structural part (including a 
putative intron of 52 nucleotides) and 537 nucleotides of the transcriptional terminator region. 

The sequence immediately downstream from the signal sequence and the Xhol cleavage site encodes 

55 an amino acid sequence which is in complete agreement with the sequence obtained for the 5 kDa 
fragment of polygalacturonase II under the prediction that cysteine will be present at position 3 (Example 
1.3). 

The DNA sequence encodes a leader peptide of 27 amino acids, arginine being the last amino acid 

40 
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before the mature protein sequence. This leader peptide must be removed by proteolytic cleavage. Since 
signal peptidase cleavage sites are not found immediately after arginlne residues and not immediately after 
charged amino acids in general (von Heijne; ref. 39). the leader- peptide is removed In at least two 
proteolytic steps, i.e. the leader peptide represents a prepro-sequence. In this case, signal peptidase 
5 cleaves the pre-sequence, or signal-peptide. whereas the remaining pro-sequence is cleaved by yet another 
protease. 

The polygalacturonase It structural gene contains one intron, rnost probably comprising the nucleotide 
sequence from position 1987 to 2038. This sequence contains termination codons in all three possible 
reading frames. The presence of this intron changes the reading frame, in a manner which is consistent with 

10 the protein sequencing data obtained (see Example 1). The reading frame before the intron is confirmed by 
the amino acid sequences of the cyanogen bromide fragments (Example 1.3), whereas the reading frame 
after the intron is confirmed by the amino acid sequence of the tryptic peptide TP4 (Example 1.5), which 
sequence can also be deduced from the nucleotide sequence from position 2183 to 2225. The 5 splice site 
of the intron, GTAAGC. resembles the fungal 5 splice consensus GTPuNGT, whereas the 3' slice site TAG 

75 is in complete agreement with the fungal consensus 3' splice site PyAG (ref. 41), 



Example 4.2: The polygalacturonase I (pgal) gene of A niger N400 



20 Suitable restriction fragments are isolated from pGW1900 and pGW1902 after agarose gel elec- 
trophoresis as described in Example 3.9. These are ligated into Ml 3mp 18RF and M13mp19RF vectors 
according to the BRL Ml 3 cloning/dideoxy sequencing manual (ref. 11). Competent cells are made as 
described in the Pharmacia Manual for the Ml 3 cioning/sequencing system. Transformation of E. coli 
JM101 is carried out as described by Messing et al., ref. 30. Isolation of single-stranded DNA tenriplales 

25 from recombinant phages follows conventional methods (e.g. ref. 11, pp. 29-34). The resulting clones are 
sequenced using the T7 sequencingTM kit from Pharmacia (Uppsala, Sweden) using the conditions 
recommended by the supplier. 

pGW1900 is sequenced from the Eco Rl site at its approximate map position 5750 until the Cla I site at 
its approximate map position 3900 whicFTis located close to the Hind Ml site at about position 3790. Several 

30 oligonucleotides (304-307) are synthesized using the method of Caruthers (ref. 8). These oligonucleotides 
have the following sequences: 

Number 304 5'(d)TTCAGCCCAAGCGTCAATCC3' 
Number 305 5^(d)ACCTGAACGACTTCACCATC3' 
Number 306 5'(d)TCTGTAGQACQTCTGGTTG3' 
35 Number 307 s' (d) T G G G A G T A A A A C G A C C T A A C 3' 

They are used as specific sequencing primers for the relevant templates. 

The oligonucleotide number 307 is used for double strand sequencing of alkali denatured pGW1900 
following the supplier's instructions for the T7 sequencing™ kit. The entire sequence of the pga I gene is 
given in the sequence listing under the SEQ ID NO. 1, The sequence is based on the sequencing data 

^0 obtained with both strands. 

The 2495 bp pga I sequence starts with the first nucleotide of the Eco R! site at the approximate map 
position 6280 in pGW1900 and ends 12 nucleotides beyond the last nucleotide of the Cla I site close to the 
Hind III site indicated at 3880. The pga I sequence comprises 909 nucleotides of the pTomoter region. 1218 
nucleotides of the structural part, including two putative introns of 52 bp (intron A) and 62 bp (intron B), and 

45 3 66 nucleotides of the transcriptional terminator region. 

The amino acid sequences obtained for the mature polygalacturonase I (see Example 1.4) and for the 
21 kDa cyanogen bromide fragment (Example 1.5) are in complete agreement with an amino acid sequence 
which can be derived from the nucleotide sequence and which starts at position 1003. The DNA sequence 
thus encodes a leader peptide of 31 amino acids, lysine being the last amino acid before the mature protein 

50 starts. For similar reasons as given in the case of PGII where the leader peptide ends with an arginine 
residue, this PGI leader peptide represents a prepro-sequence which is also removed in at least two 
proteolytic steps. The sequence which hybridizes with the oligonucleotide mixture used as probe and starts 
at position 1277 of the nucleotide sequence. There is complete agreement between the N-terminal amino 
acid sequence determined for the 5.5 kDa cyanogen bromide peptide fragment as described in Example 1 

55 and the putative amino acid sequence based on the nucleotide sequence. 

The polygalacturonase I structural gene contains two introns. Intron A comprises the nucleotide 
sequence from position 1138 to 1189. The 5' splice site G T A T G T is in agreement with the fungal 
5'splice site consensus sequence GT Pu NGT (ref. 41) whereas the lariat sequence GC TAAC and the 3' 
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splice site TAG also agree with the known consensus sequences Pu CT Pu AC and Py AG. The presence 
of this intron changes the reading franne in a manner that is consistent with the protein sequence data 
obtained with the more distally located 5.5 kDa cyanogen bromide fragment (see Example 1). The second 
intron (B), comprises the nucleotide sequence 1610 up to 1671. Both the lariat sequence and the 3' splice 
5 Site agree with the known consensus sequences. The 5, splice site G 0 A 0 G A does not agree with the 
consensus sequence GT Pu NGT, but GC at the 5 splice site as well as an A at position +6 with respect 
to the 5 splice site have been found in some other genes (refs 41 and 43), 
The presence of intron B is based on the following arguments: 
a) It changes the reading frame whereas in the other two reading frames premature stops occur. 
70 b) The intron in pga II occurs at the same position and the amino acid sequence preceding this intron 
Asn - Ser - Gly - GIu is identical in both proteins. A region of strong homology is also found between the 
amino acid sequences deduced from the nucleotide sequences immediately following the putative intron: 
Amino acid residue 

PG II Asn - He - Trp - Phe - Thr Gly - Gly - Thr - Cys 
75 PG I Ser - lie - Ser - Phe - Thr Gly - Gly - Thr - Cys 
PG II lie - Gly - Gly - His - Gly - Leu - Ser - He -Gly 
PG I Ser - Gly - Gly - His - Gly - Leu - Ser -He - Gly 



20 Example 4.3: Homology between A. niger N400 polygalacturonases 



The polygalacturonases PGl and PGII isolated from A. niger catalyze the same reaction and are 
immunologically related to each other, as described in ExampTe 1 .2. 

The genes coding for these enzymes are also closely related since with the pga II gene other 
25 polygalacturonase genes are isolated from a genomic A.niger N400 library as described in Example 7.2. A 
sequence comparison of the putative PG I and PG II amino acid sequences of the mature enzymes which 
differ 2 residues in length reveals a large degree of homology. These data clearly indicate that genes 
coding for PGs are closely related and represent members of a polygalacturonase gene family. 

30 

Example 5^ Cotransformation of A. niger using the A. niger pyr A gene as selection marker and a plasmid 
carrying the polygalacturonase 1 or H gene as coTransforming^plas'mid ' 



35 Example 5.1: Propagation and purification of plasmids used to transform A. niger 



Plasmid pGW635 which is a shorter version of pGW613 (Goosen et al. ref. 13) and which also contains 
the pyr A gene, and plasmid pGW1800, which contains the polygalacturoTTase II gene, are propagated in E. 
coli MH1 and JM109, respectively. Plasmid pGW1900 is propagated in E. coli DHSap'. Plasmid DNA "is 
40 recovered from 250 ml overnight cultures as described in Maniatis et al. (pp. 90-91; ref. 6) and in Example 
3.7. 



Example 5.2: Preparation of protoplasts and transformation of the uridine auxotrophic mutant A, niger strain 
45 N593 ' : 

The A^ niger strain N593 (csp A, pyr A) which is a derivative of the parental strain N400 has been 
obtained by positive selection against the toxic analogue 5-fluoro-orotic acid like in yeast (Boeke et al. , ref. 
14) in the presence of uridine as described by Goosen et al. (ref. 13). 

50 Liquid minimal medium supplemented with 0.5 % ye^t extract. 0.2% casamino-acids, 10 mM uridine 
(Janssen Chemie) and 50 mM glucose is inoculated with 10^ conidiospores of A. riiger N593 per ml and 
incubated for 20 h at 30* C in a New Brunswick orbital shaker. Mycelium is harvested by filtration, through 
Miracloth, washed with iso-osmotic minimal medium (STC) of the following composition: 1.33 M sorbitol, 10 
mM Tris-HCI pH 7.5. 50 mM CaCla. An amount of 1 g of mycelium is resuspended in 20 ml STC. 

55 Protoplasts are released from the mycelium within 2 h by adding 150 mg of filter-sterilized Novozym 234 
(Novo Industries, Denmark) and incubating the mixture at 30 ' C in a shaker at 95 rpm. The protoplasts are 
separated from residual mycelium by filtration using a funnel with a plug of glasswool. Then cold STC is 
added to a volume of 40 ml and the mixture is placed on ice for 10 min. The protoplasts are recovered by 
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centrifugation (10 min, 2500 rpm) and the pellet is resuspended in 5 ml of STC. The protoplasts are 

pelleted once more and finally resuspended in 1 ml of cold STC. 

For transformation 5x10^ protoplasts are taken in 200 ul\ which are then incubated together with 1 ug 

pGW635 and 20 ug of pGW1800 or pGW1900. After the addition of plasmid DNA to the protoplasts 50 ui 
5 PCT (10 mM Tris-HCI pH 7.5, 50 mM CaCla. 25 % PEG6000) is added and the incubation mixture is kept 

on Ice for 20 min. Then another 2 ml PCT Is added and the mixture is incubated for a further 5 min at room 

temperature. Finally, 4 ml STC are added and mixed. Aliquots of 1 ml of this final transformation solution 

are mixed with 4 ml liquified iso-osmotic MM top-agar stabilized by 0.95 M sucrose. The protoplast mixture 

is immediately plated on agar plates containing the same iso-osmotic minimal medium and these are 
70 incubated at 30 C. Appropriate control experiments include protoplasts treated similarly without plasmid 

DNA and on non-stabilized minimal medium with 2.5 mM uridine. 

After 3 days of growth at 30 'C well growing transformants appear which sporuiate (150 

transform ants/ug pGW635). Besides, a large number of presumably abortive transformants appear. Approx. 

300 transformants are obtained with pGW1800 and approx. 100 with pGWISOO. Of each of the transfor- 
75 mants with pGWISOO and pGW 1900 twenty colonies are picked at random, spores of individual 

transformants are taken and plated out separately on 50 mM glucose minimal medium to obtain single 

colonies which are purified once more and subsequently used to propagate spores for further transformant 

analysis. 

20 

Example 5.3: Selection of polyg alacturonase overproducing co-transformants of A. niger strain N593 by 
halo-formation on pectin containing solid media 



Approximately 200 colonies transformed with pGW1800 and approximately 100 transformed with 
25 pGW1900 obtained as described in Example 5.2 have been screened for an increase in polygalacturonase 
activity by a colony screening procedure without preceding purification. The medium used for screening is 
composed out of 2 % (w/v) glucose. 0.5 % apple pectin (34.8 % degree of esterification. Obipektin, 
Bischoffszel!) minimal medium salts and spore elements; 6 g/l NaNOa; 0.2 % yeast extract; 0.2 % peptone. 
0.004 % Triton X-100 and 1.2 % agar. The Petri dish is inoculated in the centre and incubated for 2 days at 
30 30 C. The colonies are stored overnight in the cold. Then the surface of the plate is stained by adding an 
overlay of 5 ml of a 0.05 % ruthenium red solution, which is incubated for 5 min while shaking. The 
unbound dye is then removed by washing with distilled water for another 5 min. The production of 
polygalacturonase under these conditions is indicated by the size of the halo formed. Approximately 50 % 
of the transformants thus analyzed has a polygalacturonase activity which is higher than the wild type. 
35 Twenty colonies of pGWISOO transformants and 7 of pGW1900 transformants are picked on the basis 
of the size of the halo formed on pectin containing solid media and after a second screening with these 
positive colonies after 45 h of grov/th, 6 of each of the transformants are finally picked and purified as 
described in Example 5,2. 

40 

Example 5.4: Genomic analysis of polygalacturonase II transformed A. niger strains 



Transformants obtained according to Example 5.2 or 5.3 as well as the parental wild type strain N402 
are cultured on liquid minimal medium supplemented with 0.5 % yeast extract. 0.2 % casamino acids, 50 

45 mM glucose and NaNOa (6 g/I). After 18 h of growth at 30 'C the DNA is extracted and analyzed for the 
presence of co-transforming plasmid. A. niger DNA is isolated as already described hereinbefore. 

Digests of the chromosomal DNA (2 ug) are made at 37* C in a reaction volume of 200 u\ consisting of 
50 mM Tris-HCI pH 8.0, 10 mM MgCb. 100 mM NaCt and 4 mM spermidine which has been adjusted to 
pH 7.5 with Tris base before its addition to the reaction mixture. 20 U of both EcoRI and Sail are added and 

50 the reaction mixture is incubated for 2 h at 37*C. Then another 20 U of both enzymes is added and the 
incubation is continued for another 2 h. The reaction mixture (200 ul) is subsequently extracted with 100 ul 
chloroform. The DNA is subsequently precipitated from the aqueous phase by the addition of 1/10 (v/v) of a 
3M sodium acetate buffer pH 5.2 and 2,5 vol of ethanol. After incubation at 4* C for 1 h and after 
centrifugation, the DNA pellet is air-dried and dissolved in 20 ul sample buffer. DNA of the transformants 

55 'and of A. niger N402, digested by EcoRt/Sall. is analyzed by hybridization of Southern blots as described 
hereinbefore with the 1.2 kbp BamHI/Bglll fragment of pGW1800 as PGII probe. 

The cotransformation frequency is found to be over 75 %. A number of 9 transformants has been 
analyzed by Southern blotting. The probe hybridizes in N402, with a large genomic fragment. In the 
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genomic blots of the transformants which contain pGW1800 sequences, a 4.1 kbp EcoRI-Sal insert is found. 
Depending on the copy number the intensity of this band varies. In the cases analyzed the genomic 
fragment which represent the wild type gene, is always found, indicating heterologous integration. The 
analysis of the polygalacturonase II production in A. niger transformant named hereinafter N593/pGW1 800- 
5 27 is given as a detailed example (Example 6). From a DNA dilution series, the copy number is estimated 
to be at least in the order of 20, Besides the wild type hybridizing fragment and the 4,1 kbp fragment a few 
other minor hybridizing fragments are found in this strain which represent border fragments of type II 
integration events or rearrangements. 

10 

Exampel 5.5: Polygalacturonase II production by transformed A. niger strains and by A. niger N402 

Twenty of the pGWIBOO transformants described in Example 5.2 and six of the pGWIBOO transformants 
described in Example 5.3 are grown on a medium which consists of a minimal salt medium to which urea 

75 (4.2 g/l), 1 % (w/v) apple pectin (d.e. 61.2 %) and 1 % (w/v) wheat bran is added. The cultures are grown 
for 43 h at 30* C using a Gallenkamp orbital shaker at 250 rpm and culture filtrates are obtained. The 
mycelium is removed by filtration over Miracloth using a Buchner funnel. 

The PGM content of these samples is examined by Western blotting, which is performed using the 
alkaline phosphatase detection method according to the Biorad instruction manual.. On the basis of the 

20 signals on the Western blots, amongst the 20 transformants randomly obtained in Example 5.2, 70 % 
produces significantly more polygalacturonase II than A. niger N402. The transformants selected on the 
basis of the halo-size using the plate-screening method (see Example 5.3) all produce significantly more 
enzyme than A. niger N402. 

The A. niger strain N402, and three of the transformants, named N593/pGW1 800-27, N593/pGW1 800- 

25 30 and N593/pGWl 800-37, are selected for measuring polygalacturonase activity. The latter three strains 
are from the set of pGW1800 transformants obtained in Example 5.2. 

N593/pGWl 800-30 and N593/pGW1 800-37 both contain multiple copies of pGWISOO, but less copies 
than N593/pGW1 800-27. The strains are grown on a 1 % pectin medium (d.e. 61.2 %) to which 1 % (w/v) 
dried and milled sugar beet pulp is added, using the conditions as described above. In order to remove 

30 reducing sugars and inhibitors of PG activity. PGII is partially purified after the fermentation using cross- 
linked alginate. 1 ml of culture filtrate is diluted with 1 ml 20 mM sodium acetate buffer pH 4.2 and is then 
loaded on a small column containing a bedvolume of 2 mi cross-linked alginate (5.2 ml/g) equilibrated in 
this buffer. The column is subsequently washed with 4 ml of the sodium acetate buffer and next the PGII 
activity is pulsed from the column, using 4 ml of 20 mM sodium acetate buffer pH 4.2 to which 1 M NaCI 

35 has been added. The PG activity in the eluate is then determined as described (ref. 2) and from the value 
obtained the PG activity in the culture filtrate is calculated. 

The following PG activities have been calculated for the culture filtrates obtained 20 h after inoculation: 
2.2. 5.6, 5.8 and 10.8 units/ml for the strains N402, N593/pGW1 800-30, N593/pGW1 800-37 and 
N593/pGW1 800-27. respectively. For the culture filtrates obtained after 40 h these- values are, in the same 

40 order, 2.8. 13.7, 16.4 and 30.9 units/ml. respectively. 

It thus appears that pGW1800 can be successfully used to transform A. niger to produce a much higher 
PG activity. 



45 Example 5.6: Polygalacturonase 1 production by transformed A. niger strains and by A. niger N402 and 
N593. ~ 

Seventeen of the pGW1900 transformants described in Example 5.2. and six of the pGW1900 
transformants described in Example 5.3. as well as A. niger N402 and a pyr* transformant of N593 are 
50 grown on a medium which consists of a minimal salt rnedium to which urea (4.2 g/l.), 1% (w/v) apple pectin 
(d.e. 61.2%) and 1% sugar beet pulp is added. The cultures are grown for 64 h at 30* C using a 
Gallenkamp orbital shaker at 250 rpm and in between samples of the culture filtrate are taken after 20 h and 
39 h. The PG I content of these samples is examined as the PGII content in Example 5.5 by Western 
blotting. 

55 On the basis of the signals on Western blots amongst the 23 transformants tested, 65% produces 
significantly more PG I than A. niger N402. The transformants selected on the basis of the halo-size all 
produce more PGI than A. niger N402 and four out of six transformants are high producers. Activity 
measurements indicate that in this medium the activity after 20 h is higher than after 39 h. Taking twelve 
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different PGI transformed A. niger strains the activity varies between 2.8 and 7 U/ml whereas the 
untransformed control strainnM402 and a A. niger N593/pGW635 transfornnant produce 0.5 and 0.8 U/ml 
respectively. This activity is at least the resTjit of wild type levels of both PG! and PGM whereas the increase 
in the transformants is the result of more PGI expression. 

5 

Example 6: Isolation, purification and characterization of polygalacturonase II from the PGII overproducing A. 
niger transformant N593/pGW1 80'0^7 

w 

Example 6.1: Culture conditions to prepare polygalacturonase 11 



The A. niger PGI! transformant N593/pGW1 800-27 described In Example 5.4, is grown on complete 
medium in" Petri-dishes for 3-4 days at 30*0 to produce conidla. The conldia are harvested in 5 ml sterile 

75 saline containing 0.005 % Tween 80 per plate. The spore suspension is agitated on a Griffin shaker for 20 
min., and after counting the spores, 10^ spores/ml are added to 1 I siliconized Erienmeyer flasks containing 
300 ml sterile growth medium. This medium is a minimal medium containing 70 mM ammonium chloride as 
the nitrogen source and 1 % (w/v) apple pectin (degree of esterification 61.2 %) and 1 % (w/v) sugar beet 
pulp as carbon sources. The mycelium is grown for 43 h at 30* C using a Gallenkamp orbital shaker at 200 

20 rpm. After growth the mycelium is removed by filtration through Miracloth using a Buchner funnel. The 
culture filtrate is adjusted to pH 4.2 using 1 N NaOH. 0,02 % sodium azide is added in all further steps to 
prevent microbial growth. 



25 Example 6.2: Purification of polygalacturonase II obtained from the A. niger transformant N593/pGW1 800-27 

The culture filtrate (approx. 2.5 I) is applied to a crosslinked alginate column (2.5 x 25 cm) equilibrated 
with 20 mM sodium acetate buffer pH 4.2. After loading the column is eiuted consecutively with one bed 
volume of the equilibration buffer, one bed volume of 20 mM sodium acetate buffer pH 5.6, a 1200 ml linear 

30 salt gradient (0-0.5 M NaCl) in the previous buffer, and finally 275 ml of a 1M NaCI solution in the same 
buffer. Fractions of 6 ml each are collected and tested for their enzymatic activity as described in Example 
1.1. The central part of the active fractions is pooled and dialyzed thrice against a 20 mM bis Tris HCI 
buffer pH 6.0. The final enzyme solution (475 ml) is then applied to a DEAE-Sepharose Fast Flow column 
(Pharmacia) equilibrated with the same buffer. After washing, a NaCI gradient is applied in the same buffer 

35 and polygalacturonase activity becomes eiuted. at approx. 100 mM sodium chloride. The active fractions are 
analyzed by SDS-polyacrylamide gel electrophoresis. These fractions which have a high PGII content are 
pooled (54 ml), dialyzed against 20 mM bis Tris HCI buffer pH 6.0 and further purified on a Mono Q column 
(Pharmacia) equilibrated in the same buffer. After loading, the enzyme is eiuted by applying a salt gradient 
(0-1 M NaCI). 

40 The enzyme is arbitrarily collected in two different pools A and B corresponding with the 0.2-0.26 M part of 
the sodium chloride gradient and with the 0,26-0.34 M part of gradient. Both fractions contain a single 
protein band upon SDS-polyacrylamide gel electrophoresis at the same position as PGII. 



45 Example 6.3: Amino acid sequence determination of the N-terminal part of polygalacturonase II 



200 ug of PGII from the combined pools A and B purified according to Example 6.2 is dialyzed three 
times against 1 I Millipore filtered distilled water and lyophilized. The amino acid sequence is determined as 
described in Example 1.3, using an Applied Biosystems model 470A gas phase protein sequencer. The 
50 following N-terminai amino acid sequence is determined for the enzyme: 



55 
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Position: 



1 



5 



Amino acid: asp - ser - X - thr - phe - thr - 



thr - 



ala - 



5 



Position : 



10 



15 



Amino acid: ala - ala - ala - lys - ala - gly 

Position: 20 

Amino acid: lys - X - ser - thr - ile 



- lys 



- ala - 



10 



The cysteine residues in the protein have not been modified and are thus not detected. They are likely to 
occur at position 3(X) and at position 18(X) of the sequence (see Exannple 1.3). The last three amino acid 
residues are only detected at low levels. The sequence obtained for this pure enzyme corresponds exactly 
with the amino acid sequence of the N-terminus of PGM deduced from the nucleotide sequence of the PGII 
gene (see Example 4 and Figure 4, formula II). The corresponding nucleotide sequence also shows 
cysteine residues at the position expected, namely at residues 3 and 18. This sequence also corresponds 
with the sequence determined for the 5 kDa cyanogen bromide fragment (see Example 1.3) which therefore 
corresponds with the N-terminus of the protein. 



Example 6.4: Properties of poiygalacturonase II purified from transformant N593/pGW1 800-27 



The enzyme purified according to Example 6.2 has been compared with the enzyme purified from 
Rapidase as described in Example 1.1. Both enzymes have an identical apparent molecular mass of 38 kDa 
on SDS-polyacrylamide gels and they react identically with polyclonal antibodies raised against the purified 
PGII. A monoclonal antibody which reacts only with a continuous epitope of PGII, and not with PGI. IMA, NIB 
or IV. also reacts with the enzyme purified according to Example 5.2, Upon isoelectric focusing PGII 
produced and purified according to Examples 6.1 and 6.2 shows a sharp band at a pl value identical to that 
of PGII (pl = 5.2). Due to the microheterogeneity of PCII, a number of other bands have been observed 
with lower pl values when using a pH gradient from pH 3-7 or from pH 3-10. Similar patterns are obtained 
when using PGII produced according t Example 6.1, 



Example 7: Detection and isolation of sequences related to the polygalacturonase II gene 



Example 7.1: Detection of sequences related to the polygalacturonase II gene 



DNA is isolated from A^ niger NW756 using the procedure which is described above. The DNA is 
restricted with the enzymes BamHI, EcoRI and the combination of these enzymes as described in Example 
5,4. The DNA fragments are then separated on an 0.6 % agarose gel and they are subsequently transferred 
to a nitrocellulose membrane as described in Example 3.7. The baked membrane is prehybridized for 2 h at 
60 °C in prewarmed hybridization buffer, which consists of 1 % BSA (Boehringer), 1 mM EDTA, 0.5 M 
sodium phosphate pH 7.2 (added from a 1 M stock which is composed of 89.0 g of Na2HP04'2H20 and 4 
ml of 85 % H3PO4 per litre) and 7 % SDS, as described by Church and Gilbert (ref, 38). and to which 
sheared and denatured herring sperm DNA is freshly added to a concentration of 0.1 mg/ml. Then the nick- 
translated 1,2 kbp BamHI-Bglll fragment of pGW1800. which is prepared as described in Example 3.7, is 
added and the hybridization is allowed to proceed for 44 h at 6O' C while gently shaking. The membrane is 
then washed two times in prewarmed (60 'C) hybridization buffer (without herring sperm DNA) for 10 min. 
Subsequently the membrane is washed two times for 20 min at 60 ' C in a prewarmed buffer which consists 
of 5xSSC, 0.1 % SDS and 0.1 % Na+PaO/ 'lOHsO. The membrane is then dried and exposed to Kodak 
XAR5 film for 3 days at-60' C. 

Under these conditions unspecific hybridization is not observed, which follows from the absence of a 
significant smear of hybridizing DNA fragments and, in addition, from the absence of hybridization of the 
probe with the lambda size markers. In each lane several hybridizing bands are observed, whereas in each 
lane one band is much more intense than other bands. These intense bands corrrespond with a 3.3 kbp 
fragment in the BamHI digest, a fragment larger than 20 kbp in the EcoRI digest and a 3,3 kbp fragment In 
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the BamHI/EcoRI double digest. The bands with a lower intensity correspond with fragments of 17. 9.5, 6.4 
and 2.0 kbp in the BamHI digest, fragments of 15, 12, 6.5 and 4.8 kbp in the EcoRI digest and fragments of 
7.6, 6.4. 4.0, 3.7 and 2.0 kbp In the BamHi/EcoRI double digest. 

From the fact that specific hybridization signals are observed, It follows that the PGII gene of A. niger 

5 N400 can be used to detect specific sequences in the DNA obtained from A. niger NW756. The fragnnents 
which give strong hybridization signals are assumed to carry the PGII gene. In principle, a weakly 
hybridizing fragment can arise if a restriction enzyme cleaves the DNA within and close to the end of the 
region homologous to the probe used. However, this cannot fully explain the number of hybridizing large 
fragments which is observed in the present example. It follows that specific DNA sequences have been 

10 detected, which share homology with the PGII gene, but are not identical to it. These DNA sequences are 
assumed to be different PG genes, which is in agreement with the fact that PGs have homology at the 
protein level (see Example 1.4). 



75 Example 7.2: Isolation of genes related to the polygalacturonase II gene 



Approximately 1x10* phages from the A. niger N400 library are plated on 5 plates using E. coli LE392 
and three nitrocellulose replicas are made from each plate, as described In Example 3.4, whereas the third 
filter is incubated on top of a plate for 5 min. The first and the second filter of each plate are treated using 

20 conditions, which are referred to as "heterologous". The third filter is subjected to stringent conditions, 
which are referred to as "homologous". Prehybridization, hybridization and washing of the filters are 
performed at 60*0 under heterologous conditions and at 68° C under homologous conditions. After baking, 
the filters are wetted in 3xSSC and then prehybridized for two h in prewarmed hybridization buffer, which 
consists of lOx Denhardt's (see Example 3.4), 50 mM Tris-HCI pH 7.5, 20 mM EDTA pH 8.0. 1 M NaCI. 

25 0.5% SDS and 0.1 % sodium pyrophosphate, to which 0.1 mg/ml sheared and denatured herring sperm 
DNA is freshly added. The filters are then transferred, one at a time, to a flask containing 50 ml of 
hybridization buffer, including herring sperm DNA. to which the nick-translated 1.2 kbp BamHI-BglH 
fragment of pGW1800 has previously been added. The hybridization is allowed to proceed for 40 h and 
then the filters are washed, either using homologous or heterologous conditions. The homologous washing 

30 conditions are as follows: the filters are washed twice for 0.5 h in 2xSSC, 0.1 % SDS and 0.1 % sodium 
pyrophosphate and subsequently twice for 0.5 h in 0.2xSSC, 0.1 % SDS and 0.1 % sodium pyrophosphate. 
The heterologous conditions are as follows: the filters are washed twice for 0.5 h in hybridization buffer and 
then for 0.5 h in 4xSSC. 0.1 % SDS and 0.1 % SDS and 0.1 % sodium pyrophosphate and finally in 
2xSSC, 0.1 % SDS and 0.1 % sodium pyrophosphate. The filters are dried on the air and exposed to 

35 Kodak-XAR5 film for 3 days at -60 ' G, using intensifying screens. The phages that give positive signals are 
recovered as described in Example 3.4. 

On the filters subjected to the homologous conditions 7 positive signals have been obtained, which 
signals are also present at the corresponding position on the filters subjected to the heterologous 
conditions. These signals are regarded to result from recombinant lambda phages which carry the PGII 

40 gene. Except for these signals, 30 positive signals have been obtained on the filters subjected to the 
heterologous conditions and these signals are present at the corresponding position on both filters. The 
signal strength amongst these 30 signals varies. The majority of these signals is regarded to result from 
phages which contain a PG gene different from the PGII gene. 

The genomic A. niger N400 library is screened for the second time. Plaque lifts are prepared as 

45 described above. Except for the temperature used, the hybridization Is carried out as described above. 
Hybridization and washing Is at 62' C for one of the two filters obtained from a plate, whereas the other filter 
is hybridized and washed at 68' C. Following the hybridization, the filters are washed using high SSG 
(HSSC) conditions, either at 62'C or at 68*0. The HSSC conditions are as follows: the filters are washed 
two times for 5 minutes in 4 x SSG, then the filters are washed twice for 0,5 h in 4 x SSG and subsequently 

50 twice for 0.5 h in 2 x SSG, whereas the SSG solutions used also contain 0.1% SDS and 0,1% sodium 
pyrophosphate. 

On the filters incubated at 68' G ("homologous" hybridization condition) five strong positive signals 
have been obtained which signals are also present at the corresponding position on the filters incubated at 
62' G. Restriction analysis of one of these phages according to Example 3.5 indicates that this phage 
55 contains an insert on which the gene encoding PGII (pgall) is located. These five phages are therefore 
considered to all contain pga II. In addition, on the filters incubated at 62* G ("heterologous" hybridization 
condition) 17 positive signals are present, which are not or only weakly hybridizing on the corresponding 
filters incubated at 68' C . The signal strength amongst these 17 signals varies, 
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16 phages from, the first screening and 10 phages from the second screening, which phages do not 
contain pga II, are selected for further characterization. These phages are purified using the conditions used 
for their Initial detection. The phages characterized are the phages listed in Table VII and eight additional 
phages (numbers 2,3,7,31.33,40,41,42). The phages numbered 1 to 30 have been obtained in the first 
5 screening of the library, whereas the phages with higher numbers have been obtained in the second 
screening. 

In order to discriminate between those phages which contain the polygalacturonase 1 gene and those 
which contain other polygalacturonase II related sequences, plaque lifts of the selected phages are 
hybridized with the previously nick-transiated 1.8 kbp Hind III fragment of pGW1900 (Example 3.9.). 

10 Hybridization is at 62° C, whereas washing is also at 62 ' C while using a series of SSC buffers which 
buffers all contain 0.1% SDS and 0.1% sodium pyrophosphate (4 x SSC, 2 x SSC, 1 x SSC. 0.5 x SSC, 0.2 
X SSC and 0.1 x SSC; incubation Is for half an hour per buffer). Using these conditions, phages 2, 3, 7. 40, 
41 and 42 give strong hybridization signals, comparable to the signal obtained with phage PG 1-X7. which 
latter phage carries the pga I gene (Example 3.6:>. Restriction analysis of DNA obtained from one of these 

75 phages also indicates that this phage contains an insert on which the pga I gene is located. These six 
phages are therefore considered to all contain the pga I gene.The other phages analyzed (1, 8, 31. 33, 35, 
36. 37, 38, 43) give only a weak hybridization signaTTlf any. and these phages, therefore, do not contain the 
pga I gene. 

In order to find out which phages contain identical fragments. DNA is isolated from the phages indicated 
30 in Table VII as described in Example 3.5, and digested as described in Example 3.6 and the resulting 
fragments are subsequently characterized by Southern blot analysis. For the analysis of Hinf I and Hinc II 
digests, the agarose content of the gels is increased to 1% (w/v). Southern blots are hybridized at~60"C 
with the previously nick-translated 1.2 kbp Bam Hl/Eco RI fragment from pGW1803, whereas washing is 
also at 60 C, using the HSSC conditions described above. The restriction enzymes which are found to be 
25 useful to make an initial classification for these phages are a combination of Bam HI and Bgl II, and Hinf I 
and Hinc 11. which fatter enzymes are used separately. Hinc II and Hinf I usually generate"srhali restriction 
fragments, which, in the present case, minimizes the chance that the resulting hybridizing fragments contain 
DNA derived from the EMBL4 vector, in addition to the pga II related sequence. 
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Table VII: Classification of PG II related X phages based on the size of the hybridizing X 
DN A fragments (kbp) obtained after digestion of DNA isolated from these phages with 
Hind i, Hinf l and Bgin /Bam HI and sometimes with Sail using the 1.2 kbp BarnHI/EcoRl 
fragment of pGW1803 as a probe. 



Classes of X-phages^ 



Restriction 
enzyme 


A 


B D E F 
\ Phages belonging to the respective classes: 


G'' 




9,10,43 


4,35,36 


1,16.20,37 


8,11.12 


6.38 5 


17,27 






Fragment length (kbp) 






HincU 


• 1.25 


2.4 


1.75 


1.3; 0.4 


1.3;0.78 2.3 




Hinn 


0.7; 0.3 


1.1 


1.28 


1.8 


0.65 0.6 




Bglll/BamHI 


7.2; 0.56 
(X-IO) 


3.1 

(?t-4) 


8.7 

(k-20) 


1.7 


4.1 >23 
ik-6) 


3.0 a-17) 
4.9 (X-27 ) 


Sail 


6.9 


n.d. 


6.8 

(X.-16; k-20) 


1.3; 0.5 


n.d. 





^ Phages with number 16 and 20 aiso share a 1.64 Konl fragment and a 1.38 kbp 
Hinc II /Kpn l fragment. 

30 

^ Class G phages only weakly hybridize with PG II under non-siringent conditions, 
n.d, digestion not done, 

35 

From the data presented in Table Vil it is clear that for most phage classes several representative 
phages have been isolated independently. Given the fact that only a selection of all the hybridizing signals 
has been taken in the initial screening, the number of X-phages found for each class (class A; 3: B: 3; C: 4; 
D: 3; E: 2) indicates that different genes are found at a similar frequency. In the case of the PG I gene this 

40 number is higher (6 out of 26 phages analyzed) whereas PG II containing phages are found at the same 
frequency (12 out of 59 phages analyzed). The class G is more weakly hybridizing with both probes derived 
either from the PG 1 gene or from the PG II gene. Besides a hybridizing fragment of 3.0 and 4.9 kb in the 
phages with number 17 and 27, respectively, several non-hybridizing fragments which are identical in both 
phages and which are derived from the inserts are detected in the Bgl ll/Bam HI digest viz . a 5.6 kbp. a 5.1 

^5 kbp, a 1.45 kbp and a 0.57 kbp fragment. Also Hinc II and Hin ffdigests of both phages are almost the 
same. "~ 

The phages isolated in this Example are renamed as follows: 

50 
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5 



10 



15 



20 



Class: 


Number: 


Name: 


A 

A 


Ck 

y 






1 U 






A 1 




D 
D 


A 

^' 


\.Pfn-R<l 




OO 






OO 


\ pfi.R'5R 
r^r O DOD 




1 






1 ft 


\ pn.Pi ft 




on 


vpr; pon 




o/ 


\ p/^.poy 


D 


Q 






1 1 


\ pr5_r\i 1 

A.r^vj-U 1 1 




1 O 




c 


O 






38 


XPG-E38 


F 


5 


XPG-F5 


G 


17 


XPG-Q17 




27 


XPG-G27 


not determined 


31 


XPG-X31 


not determined 


33 


XPG-Y33 



Example 7.3: Subcloning of the PGC gene (pgaC): the construction of pGW1910 



Phage XPG-C20 (Example 7.2; phage 20, class C) is digested with Bglll, and the resulting fragments are 
separated on an agarose gel. The gel slices that contain the hybridizing 7.8 kbp Bglll are recovered and the 
fragment is then isolated as described (Example 3.6). The Bglll fragment is ligated into the BamHI digested 
and CIP (calf intestine phosphatase) treated pUC9. The ligation reaction mixture is used to transform E. coli 
JM109 (Example 3.6). Several of the resulting white colonies are streaked on YT agar supplemented with 
ampiciiiin. After growing overnight the colonies are adsorbed onto a nitrocellulose filter. The colonies on the 
filter are then lysed and the liberated DNA is fixed by baking as described by Maniatis et aL (ref. 6; p. 314). 
The baked filters are wetted in 2xSSC and they are gently rubbed with a gloved hand to remove the 
bacterial cell debris. The filters are then hybridized with the 1.2 kbp BamHI-EcoRI fragment of pGW1803, 
using the heterologous conditions described In Example 7.2 (60 " C, washes with 2xSSC). A positive clone is 
chosen and plasmid DNA is purified as described hereinbefore. The plasmid DNA is then used for 
restriction analysis to construct a physical map. 

In this way a new plasmid has been constructed. pGWl910 (Fig. 7) is the 7.8 kbp Bglll fragment of 
phage XPG-C20 inserted In the BamHI site of the vector pUC9. This subclone comprises hybridizing 
fragments of phage XPG-C20. e.g. the 1.1, kbp Sma! and 1.6 kbp Kpnl fragments. From the location of 
these fragments on the physical map of pGW1910 it is inferred that this plasmid contains the complete 
PGC gene (pga C). 

Example 8: Expression of human hybrid interferon BDBB under the control of the PGIl promoter 



Example 8.1 : Construction of plasmid pGil-IFN AM1 19 precursor (Figure 4) 

Plasmid pGW1800 is digested with EcoR! and treated with T^ polymerase as below. Reiigation of this 
DNA and transformation of E. coli DHSaF allows the isolation of a plasmid pGW1800-E, which is the same 
as pGWl800, except that the" EcoRI cleavage site is deleted. 

/-lasmid pGW1800-E is digested with Bglll and treated with bacterial alkaline phosphatase (BRL) in the 
presence of 50 mM Tris-HCI pH 8.0 and 50 mM NaCI for 1 h at 65*0. The alkaline phosphatase is 
inacrivated by digestion with proteinase K (Boehringer Mannheim) and phenol extracting. The DNA is 
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subsequently ethanol precipitated, dried and redissolved in water. Then the sticky ends are filled in with T+ 

DNA polymerase as below. 

Piasmid pJDB207-IFN AM119 (EP 205 404) is digested with Hindlli and Clal. The sticky ends of these 

linear fragnnents are filled in with T^ DNA polymerase (Boehringer Mannheim) in the presence of 0.1 mM 
5 each of dCTP, dGTP, dATP and^dTTP plus 67 mM Trls-HCI pH 7.5, 6.7 mM MgCl2, 16.7 mM (NHO2SO4 

and 5 mM DTT for 30 min at 37* C. The reaction is stopped by heating at 65' C for 5 min. The fragments 

are separated in a 0.8 % low gelling temperature agarose (BioRad) gel and the 1 kbp fragment, which 

comprises the IFN AM1 19 coding region, was excised and the DNA purified on an Elutip 0 (Schleicher & 

Schull) column, and ethanol precipitated (Schmitt & Lohen, ref. 34). 
10 100 ng of IFN AM119 fragment and the prepared pGW1800-E vector are ligated together in 5 M-I of 20 

mM Tris-HCI pH7.5, 10 mM MgCb, lOmM DTT, 1 mM ATP and 1 unit of T4 DNA ligase (Boehringer 

Mannheim) for 2 h at room temperature. This mixture is transformed into competent E. coli DH5aF' cells. 

Ampicillin resistant transformants are screened by restriction digestion of their plasrnid~DNA to identify 

those that carry the piasmid pGII-IFN AM1 19 precursor. 

15 

Example 8.2: The generation of precise pGll-lFN AM119 and pGllss-IFN AM1 19 fusions using PGR (Figures 
4and5) 

20 The PGR method followed is as described by R.M. Horton et al. (ref. 35) and is outlined in figure 5. 

pGWiaoO is linearised, by Xbal digestion, and precipitated with ethanol. After resuspension in water. 
100 ng of this DNA is subjected to amplification by polymerase chain reaction (PGR) using oligonucleotides 
A and B (Figure 5) in an automated thermal cycler for 25 cycles (each consisting of 1 min at 94* C. 2 min 
at 40' C, and 3 min at 72' C) followed by 10 min at 72° C. 100pM of each oligonucleotide and 0.5 ul of Taq 
25 polymerase (Perkin Elmer Cetus) are used for each reaction in a volume of 100 ul using the reaction buffer 
recommended by the supplier. This gives DNA 1 (Figure 5). 

Similarly treated pGWIBOO with oligonucleotide A and G gives DNA 2. 

Similarly pJDB207-fFN AM1 19 linearised with BamHl and subjected to PGR with either oligonucleotides 
D and F or oligonucleotides E and F gives DNA 3 or DNA 4 respectively. 
30 These reaction mixtures are precipitated with ethanol. redissolved in water and an aliquot is checked on 
a gel to determine the concentration of the DNA fragments in the mixtures. 

Using the same conditions as above DNA 1 and DNA 4 are subjected to PGR with oligonucleotides A 
and F to give DNA 5 and DNA 2 and DNA 3 with the same oligonucleotides to give DNA 6 (Figure 5). 

The DNA 5 starts with a BamH! site and ends with an EcoRI site and includes a perfect in frame fusion 
35 of the original starting methionine codon linked to a PGII promoter fragment with the coding region for 
mature hybrid interferon BDBB. 

The BamHI-EcoRI fragment of DNA 5 is ligated into BamHI-EcoRi cut piasmid pGll-IFN AM119 
precursor constructed in example 8.1 to create piasmid pGII-IFN AM119. 

Likewise the BamHI-EcoRt fragment of DNA 6, v/hich contains a perfect in frame fusion of the PGII 
40 signal sequence linked to a PGII promoter fragment with the coding region for mature hybrid interferon 
BDBB, was inserted in pGll-lFN AM1 19 precursor to generate piasmid pGllss-IFN AM1 19. 

Example 8.3: Gotransformation of Aspergillus niger mutant An8 with pGG59D7 and pGllss-IFN or pGlMFN 
45 '■ ' ' ■ " ■ 

The uridine anxotrophic mutant An8 (= DSM 3917. disclosed in EP 278 355) is cotransformed with 
piasmid pCG59D7 and pGllss-IFN or pGII-IFN AM119 to yield uridine prototrophs. 

Conidiai spores of A. niger An8 are grown for 4 days at 28* C in complete medium until fully 
sporulated. 2x10^ conidiospores are used to inoculate 200 ml minimal medium supplemented 1 g/i arginine 
50 and uridine. 

After 20 hours growth at 28*G and 180 rpm. the mycelium is harvested by filtration through Miracloth, 
washed twice with 10 ml 0.8 M KGI, 50 mM CaCh and resuspended in 20 ml 0.8 M KGI, 50 mM GaCla, 0.5 
mg/ml Novozym 234 (Novo Industries). The mixture is incubated in a shaking waterbath (30* G, 50 rpm.) 
until sufficient protoplasts are released (detected microscopically after 90-120 min). The protoplast suspen- 
55 sion is filtered through a glass woo! plug in a funnel to remove mycelial debris. The protoplasts are pelleted 
by mild centrifugation (10 min, 2000 rpm) at room temperature and washed twice with 10 ml 0.8 M KGI, 50 
mM GaGl2. The protoplasts are finally resuspended in 200-500 ii\ 0.8 M KGI. 50 mM GaCIa to give a 
concentration of lx 10^ /ml. 

51 
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For transformation a 200 ui aliquot of the protoplast suspension is incubated with 5 ug of pCG59D7 and 
50 ug pGllss-IFN AM119 or pGII-IFN AM1 19 DNA. 50 m PCT (10 mM Tris-HCI pH 7.5, 50 mM CaCh, 25 % 
PEG 6000). The incubation mixture is kept on ice for 20 min. another 2 ml of PCT are added and the 
mixture incubated for further 5 min at room temperature. 4 ml 0.8 M KCI. 50 m(Vl CaCIa are added and 1 ml 
5 aiiquots of the final transformation solution are mixed with liquified minimal agar medium (Minimal medium 
+ 1 g/l arginine + 10 g/i Bacto-Agar (Difco)). stabilised with 0.8 M KCI. The mixtures are immediately 
poured on agar plates of the same medium and incubated at 30° C. 

After 2-3 days of growth at 28* C, stable transformants appear as vigorously growing and sporulating 
colonies on a background growth of many hundred small, presumably abortive, transformants. 



Example 8.4: Expression of the Hybrid-lnterferon BDBB gene under the control of the PGII promoter 



Transformants of the cotransformation experiment (Example 8.3.) are picked and analysed for interferon 
75 expression. Interferon activity is determined according to the procedure of Armstrong (J.A. Armstrong, Appl. 
Microbiol. 21_ , 732(1971)) using human CCL-23 cells and vesicular stomatitis virus (VSV) as the challenge 
virus. 

Conidial spores from transformants are individually precultured into 50 ml of a preculture medium 
(Pectin Slow Set L (Unipectin. SA, Redon. France) 3 g/l, NH*CI 2 g/i. KH2P0^ 0.5 g/l. NaCI 0.5 g/l. 
20 Mg2S04'7H20 0.5 g/i, Ca2S0<v'2H20 0.5 g/l, pH 7.0, 1 % arginine). The preculture is incubated for 72 
hours at 250 rpm and 28' C. 10 % of the preculture is used to inoculate 50 ml of main culture medium 
(Soybean fluor 20 g/l. pectin Slow Set 5 g/l. 1 % arginine). The culture is grown up for 72-96 hours at 250 
rpm and 28' C. 

At various times (every 20 hours) samples are taken, the cells are pelleted by centrtfugation and broken 
25 by freezedrying and dry grinding. Supernatant and ceil extracts are both tested for inteferon activity as 
described (supra). The bulk of the interferon activity is found secreted into the medium in transformants 
carrying pGllss-lFN AM119 while in transformants carrying pGlI-lFN AM1 19 it is mainly in the cell extract. 



30 Example 9: Polygalacturonase 1 and polygalacturonase 11 production by transformed A, niduians strains 



Example 9.1: Propagation and purification of piasmids used to transform A. niduians G191 E.coli MH1 



35 pGW635 is propagated in E. coli MH1, pGW1800 in JM109 and PGW1800 in DH5oF'. 



Example 9.2: Preparation of protoplasts and transformation of the uridine auxotrophic A. niduians mutant 



40 The A^ niduians mutant strain G191 (pyrG, pabaAl, fwAl, uaY9) has been described by Ballance and 
Turner. 1985 (ref. 27). 

Mycelium is grown at 37° C and further as described for A. niger (see Example 5.2.) except that 2 mg 
p-aminobenzoate is added per liter medium. Protoplasts are prepared following the procedure described for 
A niger . For transformation 5x10^ protoplasts are taken in 200 ul STC, which are then incubated together 
45 with 1 HQ pGW635 and 20 ug pGWISOO or 25 ug pGWISOO. Plates are incubated for 3 days at 37' C, 

Approx. 50 transformants/ug pGW635 are obtained in the cotransformation experiment, and in each 
experiment 20 transformants are further purified on 50 mM glucose minimal medium. These strains are 
subsequently used to propagate spores for further analysis. 

50 

Example 9.3: Western blot analysis of polygalacturonase I and II production by transformed A. niduians 



The cotransformants obtained according to Example 5.2. using pGWISOO as the cotransforming 
55 plasmid are grown submerged in 75 ml minimal medium using 2 mg/1 p-aminobenzoate, 7.5 g/l NH4.NO3 as 
nitrogen source and 1% (w/v) apple pectin (d.e. 61.2 %) plus 1% (w/v) sugar beet pulp as carbon source 
starting with 10^ conidiospores per ml. The Gallenkamp orbital shaker is used at 250 rpm and the growth 
temperature is maintained at 30* C. 
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Samples are taken after 43 and 68 h of growth, centrifuged and the supernatant is dialyzed overnight at 
4'C against 5 mM sodium phosphate buffer pH 6.5 containing 0.02 % (w/v) sodium azide. Analysis of the 
PG II content of these samples is examined by Western blotting (see page 69) using polyclonal or 
monoclonal antibodies. The A. niduians samples used as control do not contain PG II cross-reactive 

5 material as tested with monoclonal antibodies. 

The twenty transformants analyzed all produce polygalacturonase II although the amount varies 
amongst the different transformants. Besides a major band of the expected molecular weight (38 kDa) 
several minor bands of lower molecular weight, most likely representing degradation products, are 
observed. The transformant which produces the most enzyme is A. niduians G191/pGW1800-13. The 

10 polygalacturonase activity of this transformant using different growth media was compared with the activity 
of the recipient strain as shown in Table VIII. 



75 



Table VTII: Polygalacturonase acrivides of culture filtrates of the A. niduians transformant 
G191/pG\V18G0-13 and of A.Anidulans GI91. The strains are grown by inoculating 
10^ spores/ml at 30°C in minimal medium using N- and C-sources as defmed and high 
phosphate (15 g KH2P04/liter) plus 0.1% yeast extract- 



so 



25 



30 



Strain 



G191 

G191/pGW635-I 
G191/pGW1800-13 



C-Source 



3 % glucose 
3 % glucose 
3 7o b-2lucose 



N-Source 



1 % NH4CI 
1 % NH4CI 
1 % NH4CI 



Fermen- Polygalac- 
tadon turonase 



time 



40 
40 
40 



G 19 1 1 % pecrin + I % sugar beet pulp 1 % NH4CI 40 

G191/pGWl 800-13 1 % pecdn + 1 % sugar beet pulp 1 7o NH4CI 40 



acnvity 
(U/ml) 

n.d. 
n.d. 
120 

n.d. 
130 



n.d. = not detectable 

35 

Since transformant G191/pGW1800-13 synthesizes high levels of PG 11 in the absence of pectic 
substances, the PGII production of the other A. niduians Gl9l/pGWl800 transformants obtained is also 
further analyzed. To this end 19 transformants~as well as Gl91/pGW635-1 are grown in a medium which 
consists of minimal medium salts, using 0.4% NH4CI as the nitrogen source and 5% glucose as the carbon 
source, with phosphate added at a high concentration (15 g/I KH2PO4). Culture filtrates are obtained after 46 
and 69 hrs and they are subsequently analyzed by Western blotting and probing with the monoclonal 
antibody, without prior concentration and dialysis of the culture filtrates. At least 17 out of the 19 A. niduians 
Gl91/pGWl800 transformants synthesize PG 1! on the glucose medium, whereas no signal is obtiined with 
-5 A. niduians G191/pGW635-1 . The fact that clear signals are obtained implies a high expression ievel in 
comparison with A. niger N402 grown under polygalacturonase inducing conditions such as described in 
Example 5.5, since only weak PG II signals are obtained on Western blots of culture filtrates of the non- 
transformed strain, when using nonconcentrated filtrates and the monoclonal antibody. The PG II production 
A. niger G191/pGW1800-13 and six other Gl91/pGW1800 transformants using the medium as described 
above, but with 1% beet pulp and 1% pectin as the carbon sources instead of 5% glucose, is also analyzed 
as described above. In contrast to A. niduians Gl91/pGW1800-13 these six transformants produce much 
more PG 11 on the medium with pectin and beet pulp, whereas again no signal is found with A. niduians 
Gl91/pGW635-1. 

The cotransformants obtained according to Example 9.2. using pGW1900 as the cotransforming 
55 -plasmid are grown on two different media. The first medium consists of a high phosphate minimal salt 
medium to which NH^Cl (4.0 g/I), 1% (w/v) apple pectin (d,e. 61.2%) and 1% (w/v) sugar beet pulp are 
added, whereas some transformants are also grown in this medium with urea (4.2 g/I) as the nitrogen 
source instead of NH^Ci. The difference with the previous low phosphate minimal medium (page 28) is that 
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15 g KH2PO4- per liter is used instead of 1.5 g. The second nnedium also consists of a high phosphate 
minimal salt medium to which 0.1% yeast extract. NH4CI (10 g/l) and 3% (w/v) glucose are added as 
nitrogen and carbon source. In both media 2 mg/1 p-aminobenzoate is added. The cultures are grown for 42 
h at 30" C using a Gallenkamp orbital shaker at 250 rpm and samples of the culture filtrate are taken after 

5 20 h and 42 h. In Western blots of A. nidulans G191, grown under similar conditions as the transformants, 
no or very little protein is detected~which cross-reacts with polyclonal antibody raised against purified 
polygalacturonase L In the culture filtrate of 75% of the transformants PG I is detected at significant levels. 

Using NH^GI instead of urea and high phosphate also in complex media, considerably reduces the 
proteolytic degradation of the polygalacturonase I which is produced. On the basis of Western blotting 

10 results and activity measurements transformant G191 pGWI 900-6 is estimated to produce over 1 g of 
enzyme per liter as indicated in Table VI since the specific activity of PG I is 550 U/mg (Kester and Visser, 
1990). 

Table VI: Polygalacturonase activities of culture filtrates of the A. nidulans transformant 

15 

G191/pG\V1900-6 and of the control strain G 19 l/pGW635-l. The strains are grown by 
inoculating 10^ spores/ml at SO^'C minimal medium using N- and C-sources as indicated 
and high phosphate (15 g KHjPO^.) 

20 

C-Source N-Source Fermen- Poly- 



25 


Strain 






tation 
time 


galac- 
turonase 
activity 
(U/ml) 




G19l/pGW635-l 


3 % glucose 


0.4 % NH4CI 


64 


n.d. 


30 


G191/pGW635-l 


1 % pectin + 1 % sugar beet pulp 


0.4 % NH4CI 


49 


n.d. 




G191/pGW635-l 


1 % pectin -r I % sugar beet pulp 


0.4 % urea 


26 


n.d. 




G691/pG\V 1900-6 


3 % glucose 


0.4 % NH4CI 


64 


5 


35 


G19I/pGW1900-6 


1 % pectin + 1 % sugar beet pulp 
1 % pectin + 1 % sugar beet pulp 
1 % pectin + I % sugar beet pulp 


0.4 % NH4CI 
0.4 % NH4CI 
0.4 % NH4CI 


22 
26 
49 


17 
125 
570 


40 


G19i/pGW190O-6 


I % pectin + 1 % sugar beet pulp 
1 % pectin + 1 % sugar beet pulp 
I % pectin + 1 % sugar beet pulp 


0.4 % urea 
0.4 % urea 
0.4 % urea 


22 
26 
49 


125 
225 
140 



n.d. = not detectable 

45 

The pga I and pga II gene coding for polygalacturonase I and II respectively are not expressed in 
A.niger on glucose "containing media, even in multicopy transformants with a high gene dose. The 
expression of these A. niger genes in A. nidulans transformants like in strains Gl9l/pGW1 900-6 and 

50 Gl91/pGW1900-10 and like in strains Gl'91/pGW1800-13 and Gl91/pGW1 800-20, respectively, indicates 
hov/ever that in A. nidulans catabolite repression of these genes is circumvented. This implies different 
specificities in the catabolite repression system amongst these different Aspergillus species. This can be 
exploited to express a particular A. niger polygalacturonase gene in a different host like A. nidulans under 
conditions which avoid expression of polygalacturonases of the host itself. The polygalacturonase enzymes 

55* thus obtained are practically pure. 



Example 10: Isolation, purification and characterization of polygalacturonase II from the A. nidulans transfor- 
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mant G191/pGW1800-13 



Exampie 10.1: Production of PG N using transformant Gl91/pGW1800-13. 

5 

The A. nidulans transfornnant G191/pGW1800-13 is grown in 100 ml cultures on a 1% sugar beet pulp 
and 1% pectin mediunn composed as described in Example 5.3 except that 15 g/l KH2PO* is used instead 
of 1.5 g/I. Similarly, this transformant is also grown both at low and high phosphate levels on minimal 
medium using 5% or 1% glucose as carbon source. These media do not lead to any detectable 
;o polygalacturonase II in A. niger N402 whereas the strain A. nidulans G191 itself is unable to make PQ II as 
described in Example 9^3. 

In samples of culture filtrates of the A. nidulans transformant G 19 1/pGW1800-13 taken after 72 h using 
5% glucose and high phosphate. approxT860 U/mL PG activity are found whereas using low phosphate this 
amount is 550 U/ml. The reduced levels observed when using 1% glucose and high phosphate are mainly 
75 due to an increase in protein turnover when the carbon source becomes exhausted after 48 h. In the case 
of 1% glucose and low phosphate, the production level remains low throughout the whole culture period. 
The amount of protein obtained per liter culture filtrate using 5% glucose and high phosphate approximates 
1 g per liter. SDS poiyacrylamide gel electrophoresis Indicates that the culture filtrate of the transformant 
contains one protein band with a molecular mass corresponding with that of A. niger PG If. 

2Q 

Example 10.2: Isolation and purification of PG II from culture filtrate of A, nidulans transformant 
G191/pG\/7T80b-lT ~ ~ 



25 The culture filtrate (approx. 2 I) which has a pH of approx. 4.9 is diluted 5-fold with distilled H2O and 

then taken to pH 6.0 using 2 N NaOH, To concentrate the enzyme, the solution is loaded on 600 ml 
Sephadex DEAE-A50 pre-equilibrated in 20 mM potassium phosphate buffer pH 6.0 which is contained in a 
Buchner funnel to allow high flow rates. All the enzyme activity is adsorbed to the ionic exchange material 
and collected In 300 ml by elution with 1 M NaCI in the same buffer (recovery: 65%). The enzyme solution 

30 is then dialyzed against 10 mM Bis Tris buffer pH 6.0 and loaded on a Sepharose DEAE Fast Flow column 
(60 ml bed volume) equilibrated in the same buffer. 

The enzyme activity is eluted by applying a 0-0.5 M NaCI gradient in buffer at approx. 0.1 5M NaCI (total 
recovery: 57%). 

35 

Exampie 10.3: Properties of PG H purified from the culture filtrate of the A. nidulans transformant A. nidulans 
Gl9l/pGW1800-13. 

PG II purified according to Example 10.2 has been compared with the enzyme purified from Rapidase 
40 as described in Example 1.1 (PG ll-case). Both enzymes have an identical apparent molecular mass of 38 
kDa on SDS-polyacrylamide gels and they react identically with polyclonal antibodies and with the 
monoclonal antibody (see Example 6.4). Upon isoelectric focusing PG II produced by the A. nidulans 
transformant and purified according to Example 10.2 has much less microheterogeneity than the enzyme 
produced by A. niger since besides the major band (pi 5.2) only one minor band is found. 
45 The kinetic" properties of both enzymes have been compared by measuring at 25" G in 0.075 M sodiurin 
acetate pH 4.2 polygalacturonase activity varying the polygalacturonate concentration between 0.2 and 1 
mg/ml using a modified ferricyanide test. For the A. niger PG II from Rapidase the V^ax is 2760 U/mg of 
protein with a Km of 0.8 mg/ml polygalacturonate "(USB). For the PG II derived from the transformant the 
Vmax value is slightly higher viz . 3480 U/mg of protein whereas the is identical. Finally PG II purified 
50 from the A. nidulans transformant has been treated with cyanogen bromide according to Example 1 .3 (PG 
ll-case). This leads to an identical pattern of fragments as observed for PG II purified from Rapidase. 

Example 1 1 : The expression of the pgall gene of A. niger NW756 in transformed A. niger N593 strains 



pGW1756 (Example 3.10) is used to transform A. niger N593, using pGW635 as the selective vector 
see (Example 5.2). Eight transformants picked at random are purified and are used for further analysis. 
They are grown on a liquid minimal salt medium to which ammonium chloride (4.0 g/l). 1 % pectin (d.e. 
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61.2) and 1 % dried and miifed sugar beet pulp is added, using the conditions as described in Exannple 5.5. 
PGII overproduction into the culture filtrates of the transformants is shown by Western blotting, using PGII 
specific antibodies. Two of the PGII over-producing transformants (N593/pGWl 756-6 and N593/pGWl756- 
7) are selected and these strains and the control strain N402 (see Example 5.5) are grown again using the 
5 conditions described above. Culture filtrates are obtained at 44 h after inoculation and the polygalacturonase 
activity in the crude filtrates is determined as described (ref. 2). whereas the enzyme reaction mixture is 
buffered at pH 4.8 using 50 mM sodium acetate. 

The results also show that the pga II gene on pGW1756 is functional and that this plasmid can be used 
to transform A. niger to overexpress polygalacturonase activity. 



Example 12: Polygalacturonasec Production by transformed A. nidulans strains 



Plasmid pGW1910 carrying the pga C gene of A. niger N400 (Example 7,3) is used as the cotransfor- 
75 ming plasmid in an experiment designed to transform A. nidulans G191 to uridine prototrophy. as described 
in Example 9.2. Ten of the resulting transformants are purified and they are subsequently used to propagate 
spores for further analysis. These strains and Gl91/pGW635-1 are grown in a liquid minimal salts medium 
supplemented with 2 mg/ml p-aminobenzoate, 4.0 g/l NH4CI as the nitrogen source. 1 % sugar beet pulp 
and 1 % pectin as the carbon sources, with phosphate added at a high concentration (15 g/l KH2PO4). 
20 Inoculation is at 10^ spores/ml and incubation is at 30* C in a Gallenkamp orbital shaker at 250 rpm. Culture 
filtrates are obtained at 65 h after inoculation and the filtrates are subsequently analyzed, without prior 
purification or concentration. Western blots are incubated with a mixture of the antibodies raised against PGI 
and PGII (Example 1.2). Activity measurements (ref. 2) are performed in a 50 mM sodium acetate buffer pH 
4.8 containing 0.25 % polygalacturonic acid (USB). 
25 On the basis of the Western blotting most of the transformants obtained produce high amounts of 
polygalacturonasec (PGC). This result is confirmed by the measuring of PG activity in the supernatant of 
the transformants and of a control strain (A. nidulans G191 transformed with pGW635). 



30 Example 13: The expression of the A. niger pgaA and pgaD genes in transformed A. nidulans strains 



In addition to the pga I, pga II and pga C genes other members of the polygalacturonase gene family of 
™ ^'9®'' (Example 7.2) are used to transform A. nidulans . In the present Example clones In phage X are 
used, instead of subclones thereof in a plasmid vector. 

35 Phage X DNA is prepared from plate lysates as described in Example 3.5 and is then extracted once 

more with phenol and chloroform. For a cotransformation experiment approx. 20 ug of \ DNA is used, and 
the DNA is in about equal amounts obtained from 2 phages of the same class (e.g. XPG-A10 in combination 
with XPG-A43 or XPG-D8 in combination with XPG-D11). The cotransformation of A. nidulans G191 is 
performed as described in Example 9.2, whereas the controls now include cotransformations with pGW1900 

40 and XPGI-7. separatedly. Using X phage DNA prepared as described as above, the transformation 
frequencies are lower as compared with plasmid DNA purified by banding in cesium chloride gradients. 

The relevant transformants are purified and analyzed as described for the pGW1910 transformants in 
Example 12. In this case the transformants are also grown on a medium containing 3 % glucose as the 
carbon source (high phosphate, 0.1 % yeast extract. 1 % ammonium chloride). 

45 Transformants A. nidulans Gl91/lambdaA-1 and G191/lambdaD-1 have been obtained using the class 
PG-A and PG-D phages, respectively, mentioned above. The polygalacturonase activities in the culture 
filtrates of the beet pulp/pectin medium at 65 h after inoculation were significantly higher than in the control 
strain Gl91/pGW635-1. The pga il related sequences present in the class A and D phages thus encode 
proteins with polygalacturonase activity. On the basis of the Western blots it is concluded that these gene 

50 products, polygalacturonaseA and polygalacturonaseD, respectively, migrate with a similar electrophoretic 
mobility as PGI. 

A. nidulans Gl91/lambdaA-1 produces polygalacturonaseA also on the medium with glucose as the 
carbon source. 

55 

Example 14: The expression of hybrid interferon in transformed A. nidulans strains 



For a cotransformation experiment approx. 20 ug of plasmid DNA pGII-lFN AM1 19 or pGllss-IFN 
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AM119 is used. The cotransformation of A. nidulans G191 is performed as described in Example 9.2. 

The relevant transformants are purified and analyzed as described for the transformants in Example 8.4. 
in this case the transformants are also grown on a medium containing 3% glucose as the carbon source 
(high phosphate, 0.1% yeast extract, 1% ammonium chloride). 

5 At various times (every 20 hours) samples are taken, the cells are pelleted by centrifugation and broken 
by freezedrying and dry grindling. Supernatant and cell extracts ?re both tested for interferon activity as 
described (supra). The bulk of the interferon activity is found secreted into the medium in transformants 
carrying pGIIss-IFN AM119 while in transformants carrying pGII-IFN AM119 it is mainly in the cell extract 
which indicates that A. nidulans produces (FN from pGII-IFN AM119 or pGllss-IFN AM119 when glucose is 

10 used as sole carbon source. 



Example 15: Maceration properties of polygalacturonase II and polygalacturonase I. 



75 Cell separation of cucumber tissue is used to assay the macerating activity of A. niger polygalac- 
turonases. A cucumber fruit is purchased in a local grocery and the surface is subsequently decontami- 
nated by wiping with a tissue soaked in ethanol or by floating for one hour in Javel water (grocery grade, 
diluted to 0.5% sodium hypochlorite). The cucumber is then sliced and the soft inner part of. the slices is 
removed. Two of the resulting slices (approx. 12 g) are placed in a 100 ml Erlenmeyer flask containing 25 

20 ml of a maceration buffer to which polygalacturonase had been previously added, or not. The subsequent 
incubation is for 24 hours with gentle unidirectional shaking in a water bath thermostated at 30 'C. The 
flasks are then visually inspected. Complete maceration meets the following criteria: (i) the peel of the 
cucumber fruit is essentially free of adhering tissue, as indicated by macroscopic examination; ii) the 
maceration buffer has become highly turbid; iii) there is a positive correlation between cell separation and 

25 the high turbidity of the macerating buffer, as indicated by a high proportion of loose celts which appear to 
be intact as judged by microscopic visualization. 

Polygalacturonase II and polygalacturonase I prepared as described in Example 1 are tested in different 
buffers. Maceration buffer A (MBA) is 50 mM sodium acetate pH 4.8. Maceration buffer B (MBB) is a 
phosphate-citrate buffer, (Ishii (1976), Phytopathology 66 . pp.281 -289). MBB is prepared by mixing 0.1 M 

30 citric acid and 0.1 M Na2HP04 solutions until the desTred pH value. 4.5, is reached, and then an equal 
volume of distilfed water is added and subsequently bovine serum albumin is added to a final concentration 
of 10 mg/25 mi, using a concentrated solution of 10 mg/ml. 

Cucumber tissue is completely macerated within 24 hrs, using 1 ug.of polygalacturonase tl in MBA. In 
this buffer the cucumber tissue is not macerated in the absence of polygalacturonase or in the presence of 

35 1 ug polygalacturonase I. Maceration of cucumber tissue in MBB is tested in the absence of polygalac- 
turonase or in the presence of 10 ug polygalacturonase I or polygalacturonase II. respectively. Again, 
maceration is not observed in the absence of enzyme or in the presence of polygalacturonase I, whereas 
polygalacturonase II completely macerates the cucumber tissue. PGM prepared according to Example 10 
has also macerating properties, 

40 
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Deposited microorganisms: 

Following microorganisms and phages are deposited with the Deutsche Sammlung von Mikroorganis- 
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men und Zellkulturen (DSM), Mascheroder Weg 16, D-3300 Braunschweig: 





Microorganisms: 


DSM-No,: 


Date of 


5 






Deposition: 




Escherichia coil JM109/pGW1800 


5505 . 


August 30, 1989 




Escherichia coli DH5aFVpGW1900 


5866 


March 21. 1990 
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XPG-D12 


5953 






XPG-E6 


5946 






XPG-E38 


5963 
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XPG-F5 


5950 






XPG-G17 


5955 






XPG-G27 


5957 






XPG-X31 


5958 






XPG-Y33 


5959 
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Sequence Listing 



SEQ ID NO. 1 
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SEQUENCE TYPE; Nucleotide with corresponding polypeptide 
SEQUENCE LENGTH: 2495 base pairs 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: genomic 

ORIGINAL SOURCE ORGANISM: Aspergillus niger N400 

IMMEDIATE EXPERIMENTAL SOURCE: Plasmid pGW1900 (DSM 5866) 

FEATURES: Gene pgal encoding A. niger N400 prepro-polygalacturonase I 
from 1 to 909: promoter region 

from 901 to 963: coding region for PGI signal peptide 
from 901 to 1002: coding region for PGI prepro-sequence 
from 1003 to 2127: coding region for mature PGI 
from 1138 to 1189: intron 
from 1610 to 1671: intron 
from 2138 to 2130: stop codon 
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from 2131 to 2495: 3 non coding region, part of. transcriptional terminator region 



GAATTCGGAA GAATGTCTGC GTTCGTGCGC ACAACGACGT 4 0 

TCCTAAAATT TATCCGATGA TCAACCGAGG AACCAGGGTC 8 0 

GAACCCCTGA AAGGAATCTC GCCGAAAGGA TCAATCATAT 12 0 

TGATTCGCGG ACCCTGTAAA GTGCCCTAAA GGGTCTTTGC 160 

CACTCGATAG TGGGATCGGC ACTGGCAGTT TCTAGTCTCG 2 00 

TCAGTGCGGC CATTTCCGAC CGATCCGTAG TGCTGGTGGG 240 

TGAGTCCCAG AGCAGTCTAT ATCGATATCA TCACCAAATT 280 

CCAAGGACCG TGGGATGACG AGTATGCATT GGCACTGAGA 32 0 

ATTGCCGAGA TCATGCCCCC GTCCTGGAGG TGCATCCATC 3 60 

AAGCATGTTG CGACAGAGTG AAACAGCGTG GATAAGCCGG 4 00 

ATGGGGCAGA TGGGGGAAGA GAGCGAGACA CGGCATCAGC 440 

AGTGGAGATG CCGAGAAGTG CCGAGAGCCG CCGATGCTTC 480 
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70 



75 
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25 



30 
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GACCTTCAGC CCAAGCGTCA ATCCATGCCT TCTTGGGTCA 520 

GGGTTGGCCG GACTGTAAGC CCGTTAGCGT CTGAGATACG 5 60 

CCGGATGACA CGAACCTTGG TGCTTACCAA TGCTGTGATG 600 

CATGCAGTGC CGGCGGTATC CCTGGCTGAA GCGCCCATCG 64 0 

CCGTTAGAGA TTGATACCCG GGTGGATCGG AGGGTCCCCT 68 0 

TGGTCTTTCC ACCAATAATG GAGGTATCTC ACTGCTTTGT 72 0 

TCAGGAACAG. AAGCTTAGCA TGGACCAGTC TTTCACGTAT 7 60 

AAAACTCTCC AGGACTTCCG TCGTTGAGAT GCTCTATCCA 800 

ACAACACCTC ACTCTTCCAC TCCTTGTTCT TTCTTTCTGT 84 0 

TGACCAGACC CCATCCATTC TTTTGGCCGA AACCTGTTCT 8 80 

GTTGACCAGA ACCTATACCA ACAATCACC ATG CAC TCT 918 

Met His Ser 
1 

TAG GAG CTT CTT GGC GTG GCC GCT GTC GGC TCC CTC 95 4 

Tyr Gin Leu Leu Giy Leu Aia Ala Vai Giy Ser Leu 

5 10 15 

GTC TCT GCC GCC GCC GCT CCT TCT GGC GTC TCC GAG 9 90 

Val Ser Aia Ala Pro Ala Pro Ser Arg Val Ser Glu 

20 25 

TTC GCT i^JlG AAG GCC TCT ACC TGC ACC TTC ACC TCT 102 6 
Phe Ala Lys Lys Ala Ser Thr Cys Thr ?he Thr Ser 
30 35 

GCC TCT GAG GCC AGC GAG AGC ATC TCC AGC TGC TCC 1022 
Ala Ser Glu Ala Ser Glu Ser lie Ser Ser Cys Ser 
40 45 50 



GAT GTT GTC CTG AGC AGC ATC GAG GTC GCC GCT GGC 10 98 
Asp Val Val Leu Ser Ser lie Glu Val Pro Ala Giy 
45 55 60 
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GAG ACC CTC GAC CTG TCC GAT GCT GCT GAT GGC TCC 113 4 
Glu Thr Leu Asp Leu Ser Asp Ala Ala Asp Gly Ser 
65 70 75 

ACC GTATGTGCTC TCAGCCGTCT TCCATCCTGG CTATCACGCT 117 7 
Thr 



AACACCATCT AG ATC ACC TTC GAG GGC ACC ACT TCC TTC 1216 
He Thr Phe Glu Gly Thr Thr Ser Phe 
75 80 85 



GGA TAC AAG GAA TGG AAG GGC CCC CTG ATC CGC TTC 12 52 
Gly Tyr Lys Glu Trp Lys Giy Pro Leu He Arg Phe 

'90 95 



GGT GGT AAG GAT CTG ACT GTC ACC ATG GCC GAC GGC 128 8 
Gly Giy Lys Asp Leu Thr Val Thr Met Ala Asp Giy 
100 105 



GCT GTC ATC GAC GGT GAC GGT TCC CGC TGG TGG GAC 132 4 
30 Ala Vai He Asp Gly Asp Gly Ser Arg Trp Trp Asp 
110 115 120 



AGC AAG GGT ACC AAC GGT GGC AAG ACC AAG CCC AAG 13 60 

Ser Lys Gly Thr Asn Gly Giy Lys Thr Lys Pro Lys 

125 130 

TTC ATG TAC ATC CAC GAT GTT GAG GAC TCG ACC TTC 13 9 6 

Phe Met Tyr lie His Asp Val Glu Asp Ser Thr Phe 

135 140 145 

AAG GGC ATC AAC ATC AAG AAC ACT CCC GTC CAG GCC 14 32 

Lys Gly lie Asn lie Lys Asn Thr Pro Val Gin Ala 

150 155 
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ATC AGT GTC CAG GCT ACC AAC GTC CAC CTG AAC GAG 14 68 
lie Ssr Val Gin Ala Thr Asn Val His Leu Asn Asp 
160 165 

TTC ACC ATC GAC AAC TCC GAC GGT GAT GAC AAC GGT 1504 
Phe Thr lie Asp Asn Ser Asp Giy Asp Asp Asn Gly 
170 175 180 

GGC CAC AAC ACC GAC GGT TTC GAC ATC 'AGC GAG TCT 1540 
Gly His Asn Thr Asp Gly Phe Asp lie Ser Glu Ser 
185 190 

ACC GGT GTC TAG ATC AGC GGT GCT ACC GTC AAG AAC 157 6 
Thr Gly Val Tyr lie Ser Gly Ala Thr Val Lys Asn 
195 200 205 

2^ CAG GAC GAC TGC ATT GCC ATC AAC TCT GGC GAG 160 9 

Gin Asp Asp Cys lie Ala lie Asn Ser Gly Glu 

210 215 

30 

GCACGATATC CCTATTCCAC TATCATTCCT TCCATTCATA 164 9 

TCGCTAAC.AA TCAAACCCAC AG AGC ATC TCT TTC ACC 168 6 

35 Ser lie Ser Phe Thr 

220 

GGC GGT ACC TGC TCC GGT GGC CAC GGT CTC TCC ATC 1722 

40 

Gly Gly Thr Cys Ser Gly Gly His Gly Leu Ser lie 
225 230* 

^5 GGC TCT GTC GGT GGC CGT GAT GAC AAC ACC GTC AAG 1758 

Gly Ser Val Gly Gly Arg Asp Asp Asn Thr Val Lys 
235 240 245 

50 
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AAC GTG ACC ATC TCC GAG TCC ACT GTC AGC AAC TCC 17 94 

Asn Val Thr lie Ser Asp Ser Thr Val Ser Asn Ser 

250 255 ' 

GCC AAC GGT GTC CGC ATC AAG ACC ATC TAG AAG GAG 18 30 

Ala Asn Gly Val Arg lie Lys Thr lie Tyr Lys Glu 
260 265 

ACC GGT GAT GTC AGC GAG ATC ACC TAG TCT AAC ATC 18 66 

Thr Gly Asp Val Ser Glu lie Thr Tyr Ser Asn He 
270 275 280 

CAG CTC TCC GGA ATC ACC GAC TAG GGT ATC GTC ATC 1902 

Gin Leu Ser Gly lie Thr Asp Tyr Gly He *Val He 
285 290 

GAG CAG GAC TAG GAG AAC GGC TCT CCC ACC GGC ACC 19 38 

Glu Gin Asp Tyr Glu Asn Gly Ser Pro Thr Gly Thr 

295 300 305 

CCC TCC ACC GGT ATC CCC ATC ACT GAT GTC ACC GTT 197 4 

Pro Ser Thr Gly lie Pro lie Thr Asp Val Thr Val 

310 315 

GAC GGT GTC ACC GGT ACT CTC GAG GAT GAC GCC ACC 2010 

Asp Gly Val Thr Gly Thr Leu Glu Asp Asp Ala Thr 
320 325 

CAG GTC TAG ATT CTC TGC GGT GAC GGC TCT TGC TCT 204 6 

Gin Val Tyr He Leu Cys Gly Asp Gly Ser Cys Ser 
330 335 340 

GAC TGG ACCTGG TCC GGT GTT GAC CTC TCT GGT GGC 2 0 82 

Asp Trp Thr Trp Ser Gly Val Asp Leu Ser Gly Gly 
345 350 
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AAG ACC AGC GAT AAA TGC GAG AAC GTT OCT TCC GGT 2118 
Lys Thr Ser Asp Lys Cys Giu Asn Val Pro Ser Gly 
355 360 365 

GOT TCT TGC TAA ATCGTTCCTC CGGATGCGAG GCAACGTCTG 2160 
Ala Ser Cys 
368 

TAGGACGTCT GGTTGTATAT ATCGATCCAC ACTTGACATG 220 0 

TACTAGTTAG GTGGTTTTAC TGCCATAGTG TTAGTGAATA 22 4 0 

ATAGGAGCTT TGCTCAATAT GTGAATTTGT AGCAGAAGTA 22 8 0 

AATGAGTAGA CTGATAGAGG TAATGATGAA AGATAGAATT 2 32 0 

TAATACCAGG TGATGAAGTT AATAACGGGG .TGAATTAGGG 2350 

CGCGTAGGCT TAGGTATATA AGTTGTCATC CCTCACACTC 24 0 0 

GAAATCTCTT CCTCTTTCTT ATATCTTCCA ACCTCTGAAC 2440 

CACCAGTTGC CTCCACAGAC TAACAAGATT CTTCTATATC 24 80 

GATGCTTGAT CTCAA 2 4 95 



30 SEQ ID NO. 2 

SEQUENCE TYPE: Nucleotide with corresponding polypeptide 
SEQUENCE LENGTH: 3031 base pairs 
STRANDEDNESS: double 
35 TOPOLOGY: linear 

MOLECULE TYPE: genomic 

ORIGINAL SOURCE ORGANISM: Aspergillus niger N400 

IMMEDIATE EXPERIMENTAL SOURCE: Plasmid pGW1800 (DSM 5505) 

FEATURES: Gene pgall encoding A. niger N400 prepro-polygalacturonase II 
40 from 1 to 1 356; promoter region 

from 1357 to 1419: coding region for PGI! signal peptide 

from 1357 to 1437: coding region for PGIi prepro-sequence 

from 1438 to 2494: coding region for mature PGII 

from 1987 to 2038: intron 
45 from 2495 to 2497: stop codon 

from 2498 to 3031: 3 non-coding region, part of transcriptional terminator region 
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TGGAGAACAT 


AACACGCTGG 


4 U 


5 


i i AAi 1 CjC i 


TCjACCjAACGC 


GTTGTCTCCA 


GGTCAGAGCA 


8 0 






i L-AA i A i ACjU 


J. (jGuTTCTAC 


AACCTTGAAA 


U 






TPr* ArTT* a 7\ T 

i LsOA^jULaAA i 


CUCUC i AoAA 


C ACjjCjCj 1 AAl G 


1 bU 






i 1 Lao i J. L. AC 


C HoUCAiCjAi 


GGAAi AG IG I 


o n n 
^ U U 


70 


(jCLiL. AC i LaVj i 


oCjCTTCCvjAT 


G C AC GAG AC T 


CAGACATAAA 


2 4 0 






CCAGCACTAC 


TTTGTCCTCC 


TGCCCCGAGG 


280 




AA i Cj AA i A 


ATAGGAGGAA 


TGTCGATATT 


CGCGCTCGAT 


n O 

320 


75 


AGCATCACTC 


GGATATCGAG 


AATACCGGGC 


TGAGGTTGTT 


360 




GCAGGTGGAA 


AGATTTGCTC 


AATCTGTATC 


GAAAAAACGT 


400 




AACTTGATGA 


ATGACCTTTC 


AACAACTAAC 


CGCCCCTATC 


440 




CACTGCTGTG 


ACCAGTATAG 


TCCACCGGAT 


TCGCTCACCG 


480 


20 


TTAGCCAGTG 


GCTTCCTCCT 


TTTTCGTCAT 


CTTTTCCTCT 


520 




TCTTGACTGG 


CCCCACTCCA 


GGTTGCTTGT 


TGCCGGCAAG 


560 
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CTTTCGACCC TGATTAGCGA GCTTTGCGCA CTATTCCTGA 600 

TTTAGCTGTC GAAGGAACGG ATGGACGCTG CGCTAGTTTC 64 0 

5 ATCTCGAATC TCTGATATTA ACATTGGCTA CAGACTAGCC 680 

GATTACGCTC CGGAATCTGG CACACAGGGA 'GTTTATGGAC 72 0 

CGTTCATTGG TGGAACTAGC AAAGCACCAT AATTGCCGCG 7 60 

GACCCTGCAT TTCAGGCTGC AGCCTTACTA GGATTAGTAT 8 00 

'° AGCTGTCTTC TCGTACTCGT ACACTGCAGT ACCGGCCTGG 8 40 

AATCTCGGGA TCAACGATGG CATCCGCTTC CTCCAGGACT 8 80 

TACGGCTGCT TGTGAAGGCC AATTATAGTG TTGCTTGTTT 920 

j5 ACGTGCCAAT TCAGCGGTAC ATACAAACGC TTCTCTATCT 9 60 

TGCCCTTTTT GACAATAGGT TTACCCTCGG GAGCGGAGCT 10 00 

TTGCCTTCTT TCCACCGACA TGCGCATCCG TTCCATCACC 10 4 0 

CGCGGAACCC GTCGGCTGAT CAGCCACGCA CGGCTGGTAT 1080 

AAATTAATCG GCCACTCATG TCGAACTGAG GTTCACGGGA 1120 

AAACGCAATA TTTGAGACAA CACCTCAATA TGAAACGAGG 1150 

ATCCAGGGTC CTACATTTCC TCCAGGGGCT GTCGGCAGTT 12 0 0 

,5 ATGAACTTTT CGACCGGAA-A AGATTCGCAA TAGTCGTGAG 12 4 0 

TATAAGAACC TCGTACCTGC TCACACTGAT GTCTACTTGC 12 8 0 

TCATCATTCC ACACTCATTC AAAATCTTAC CAACAACACT 13 20 

CCTTCTGTCA TTCTTTTCTA TTGTTAACAA TTAATC ATG CAC 13 62 

Met His 
1 

J. TCG TTT GOT TCT CTT CTC GCC TAG GGC CTG GTC GCC 13 93 
Ser Phe Ala Ser Leu Leu Ala Tyr Gly Leu Val Ala 
5 0 

■° GGC GCC ACC TTC GCT TCT GCC TCT CCT ATC GAA GCT 14 34 
Gly Ala Thr Phe Ala Ser Ala Ser Pro' lie Glu Ala 
15 20 25 

'5 

CGA GAC AGC TGC ACG TTC ACC ACC GCT GCC GCT GCT 1470 



Gly Ala Thr Phe Ala Ser Ala Ser Pro' lie Glu Ala 
15 20 25 

45 

CGA GAC AGC TGC ACG TTC ACC ACC GCT GCC GCT GCT 
Arg Asp Ser Cys Thr Phe Thr Thr Ala Ala Ala Ala 
30 35 

50 
55 
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AAA GCG GGC AAG GCG AAA TGC TCT ACT ATC ACC CTT 150 6 
Lys Ala Gly Lys Ala Lys Cys Ser Thr lie Thr Leu 
40 45 . 50 

AAC AAC ATC GAA GTT CCA GCT GGA ACC ACC CTC GAC 1542 
Asn Asn lie Glu Val Pro Ala Gly Thr Thr Leu Asp 

55 60 

CTG ACC GGT CTC ACC AGC GGT ACC AAG GTC ATC TTC 1578 
Leu Thr Gly Leu Thr Ser Gly Thr Lys Val lie Phe 
65 70 

GAG GGC. ACC ACG ACC TTC CAG TAG GAA GAA TGG GCA" " 1614 
Glu Gly Thr Thr Thr Phe Gin Tyr Glu Glu Trp Ala 
^5 80 85 



GGC CCC TTG ATC TCC ATG ACT GGC GAA CAT ATC ACC 165 0 
Gly Pro Leu lie Ser Met Ser Gly Glu His lie Thr 
30 90 95 



GTC ACT GGT GCC TCC GGC CAC CTC ATC AAT TGC GAT 168 5 
Val Thr Gly Ala Ser Gly His Leu lie Asn Cys Asp 
100 105 110 



GGT GCG CGC TGG TGG GAT GGC AAG GGA ACC AGC GGA 1722 
Gly Ala Arg Trp Trp Asp Gly Lys Gly Thr Ser Gly 

115 • 120 



45 AAG AAG AAG CCC AAG TTC TTT TAC GCC CAT GGC CTT 1758 
Lys Lys Lys Pro Lys Phe Phe Tyr Ala His Gly Leu 
125 130 



GAC TCC TCG TCT ATT ACT GGA TTA AAC ATC AAA AAC 17 94 
Asp Ser Ser Ser lie Thr Gly Leu Asn lie Lys Asn 
135 140 145 
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ACC CCC CTT ATG GCG TTT ACT GTC GAG GCG AAT GAG 18 30 
Thr Pro Leu Met Ala Phe Ser Val Gin Ala Asn Asp 
150 155 

ATT ACG TTT ACC GAT GTT ACC ATC AAT AAT GCG GAT 18 66 
lie Thr Phe Thr Asp Vai Thr lie Asn Asn Ala Asp 
160 165 170 

GGC GAC ACC CAG GGT GGA CAC AAC ACT GAT GCG TTC 1902 
Gly Asp Thr Gin Gly Gly His Asn Thr Asp Ala Phe 

175 180 

GAT GTT GGC AAC TCG GTC GGG GTG AAT ATC ATT AAG 193 8 
Asp Vai Gly Asn Ser Vai Gly Val Asn lie lie Lys 
185 190 

CCT TGG GTC CAT AAC CAG GAT GAC TGT CTT GCG GTT 197 4 
Pro Trp Vai His Asn Gin Asp Asp Cys Leu Ala Val 
195 200 205 

AAC TCT GGC GAG GTAAGCAGCT CTGCATATAT GCTTGATTCG 2016 
Asn Ser Gly Giu 
210 

TAATTATATT GATATTCTAT AG AAC ATC TGG TTC ACC GGC 2 0 56 

Asn lie Trp Phe Thr Gly 

215 

GGC ACC TGC ATT GGC GGC CAC GGT CTC TCC ATC GGC 2 0 92 
Gly Thr Cys lie Gly Gly His Gly Leu Ser lie Gly 
220 225 

TCT GTC GGC GAC CGC TCC AAC AAC GTC GTC AAG AAC 212 8 
Ser Val Gly Asp Arg Ser Asn Asn Val Vai Lys Asn 
230 235 240 
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GTC ACC ATC GAA CAC TCC ACC GTG AGC AAT TCC GAA 2164 

Val Thr lie Glu His Ser Thr Val Ser Asn Ser Glu 

245 250 

AAC GCC GTC CGA ATT ' AAG ACC ATC TCT GGC GCC ACT 2200 

Asn Ala Val Arg lie Lys Thr lie Ser Gly Ala Thr 
255 260 

GGC TCC GTG TCC GAG ATT ACG TAC TCC AAC ATC GTC 2236 

Gly Ser Val Ser Glu lie Thr Tyr Ser Asn lie Val 

255 270 275 

ATG TCT GGC ATC TCC GAT TAC GGC GTG GTC ATT CAG 2272 

Met Ser Gly' He Ser Asp Tyr Gly Val Val He Gin 
280 285 

CAG GAT TAC GAA GAC GGC AAG CCT ACG GGT AAG CCG 230 8 

Gin Asp Tyr Glu Asp Gly Lys Pro Thr Gly Lys Pro 
290 295 300 

.--.CG AAC GGT GTC ACT ATT CAG GAT GTT AAG CTG GAG 234 4 

Thr Asn Gly Val Thr He Gin Asp Val Lys Leu Glu 

305 310 

AGC GTG ACT GGT AGC GTG GAT AGT GGG GCT ACT GAG 2 3 80 

Ser Val Thr Gly Ser Val Asp Ser Gly Ala Thr Glu 
315 320 

ATC TAT CTT CTT TGC GGG TCT' GGT AGC TGC TCG GAC 2416 

He Tyr Leu Leu Cys Gly Ser Gly Ser Cys Ser Asp 

325 330 335 

TGG ACC TGG GAC GAT GTG AAA GTT ACC GGG GGG AAG 2 452 
Trp Thr Trp Asp Asp Val. Lys Val Thr Gly Gly Lys 
340 345 
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AAG TCC ACC GCT TGC AAG AAC TTC CCT TCG GTG GCC 248 8 

Lys Ser Thr Ala Cys Lys Asn Phe Pro Ser Val Ala 
^ 350 355 360 

TCT TGT TAG GCTGCTAGGT TGGTGAGTTG TAGCCCTAGC 252 7 

70 Ser Cys 
362 



75 



TGAAATTCGT CTGCTTCGTC TGCTTCGTCT GCTTCGTCTG 25 67 

CTTCGTCTGC TTCTTCTGCT TCGTCTGCTT TGTCTGCTTT 2 607 

GTCTGCTTCG TCCACTTCGT CCACTTCGAC TGGTTAGATG 2 647 

GGCCTTGTAA TAGTTTTTAG AGAGAACAGA ATATGTACAG 2 687 

20 TAAGCCTTAG AGGTGGTACC GAGTTGTATA TTTATTTAAA 2727 

ATGTTACCTA TCGCGTGTCT TTATATTTAT AGCCTTTTAC 2 7 67 

ATATATACGG AGCTACAGTG GATTATCTTA CAGCCCACAC 2 80 7 

TCATCGTGCT GGGAACTACG TGAJVTGAATG CTCGGTTAGA 284 7 

AGGCCTTGCT CACTGCCACA ACCAACCAGG AACCTTGGCA 2 88 7 

GGTACATGCT TGGGCATTTT TGTCTGGCCC TATCTCTTTC 2 927 

CAGATGGTGG TCTGGATGAG TCACGGCACG AGTAGATTGA 2 9 67 

CCGCTACTCC AACCCGCGCA TAAAGCATAC GCCAGAAGTG 30 0 7 

CAAGGGATAC AA.GACAGCCA GCTG 3031 
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SEQ ID MO. 3 

SEQUENCE TYPE: Nucleotide with corresponding polypeptide 
SEQUENCE LENGTH: 3047 base pairs 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: genomic 

ORIGINAL SOURCE ORGANISM: Aspergillus niger NW756 
IMMEDIATE EXPERIMENTAL SOURCE: pGW1755 (DSM 5942) 

FEATURES: Gene pgall encoding A. niger NW756 prepro-polygalacturonase It 
from 1 to 889: promoter region 

from 890 to 952: coding region for PGII signal peptide 

from 890 to 970: putative coding region for PGII prepro-sequence 

from 971 to 2027: putative coding region for mature PGII 

from 1520 to 1571: intron 

from 2028 to 2030: stop codon 

from 2031 to 3047: 3 non coding region, part of transcriptional terminator region 
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iCACCGTTGG 


CCAATGGCTT 


CCTCCTTTCC 


40 




TGTCTTCTCC 


TTCCTCTCCT 


TGACTGGCCC 


CACTCCAGGT 


80 


5 


i CjU I i Ai 1 CjC 


CGGCAAGCTT 


TCAACCCTGA 


TTAGCGAGCT 


120 




1 HjL-Ak^AL. i. A 


Ti fT* ^ ^ ^ m rn 

TTCCCGATGT 


AGCTGTCGAA 


GGAACGGATG 


150 






i Ab i i i OATT 


TCGAATCTCT 


GAT AT T AAT A 


O A A 

200 


10 




AG i GACTAGC 


CGATCACGCT 


CCGGATCTGG 


240 




CAC ACjACj i (jA 


TTTTTTGAAT 


CATTCATTGG 


TGGAATTTGC 


O A 

280 




AAvjCjL^AUv^A i 


AAATGCCGCA 


GACCCTGCAT 


ATCGAGCTAC 


A A 

320 




A.VaCUl i AC i Cj 


GGATTAGTAT 


AGCTGCCATC 


TCGTACTCGT 


360 


15 


AL.AC rCjCACjA 


ACCGGCTTGG 


ATCCCAAGGC 


CTACGATGAC 


400 




AuCuCj i i i 


TCCAGGACTG 


ATGGCTGTTT 


GTGAAGGCGA 


440 




ATTATAGTAC 


TGCTTGTCAA 


TGTCTTGAGT 


CAGCGGTATA 


480 


20 


TATGAATCTG 


TCTTTGTCTT 


TCCACCTTGA 


CAATAATGTT 


520 




ACGCTCAAGA 


GGGCATTTTG 


CGTTCTTTTC 


ATCGACATGC 


560 




GAACCTGTTC 


GGTCACCCGC 


GGACCCCGTC 


GGCTGATCAG 


600 
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CCACCACGGC TCATATATAT TAGTTGCCCA CCATGTCGAA 64 0 

CTTAAGTTCA TTAAAAAAAA GGTAACATTT GAGACAATAT 68 0 

CTTAATGTGA AACGTGAACC CTGGACTAGC ATCTCTCCAG 7 20 

AGGCTGTCGG CAGTTATGAC TTTCCGATCA *GAAGAGATGC 7 60 

GCTGAAATTG TGACTATAAG AACCTCAAGC CTGCCGATGC 8 00 

TGAGGTGAGT TTGCTCATCA TCCTACACTC ATTTGGCATC 8 40 

AGACCGATTA CACTCTTTTT GTCCTTTTTT TCTATCGCTA 88 0 

TCATTGACC ATG CAC TCC TTT GCT TCT CTT CTG GCC 916 
Met Kis Ser Phe Ala Ser Leu Leu Ala 
1 5 

TAG GGC CTA GCC GCC AGC GCC ACC CTC GCT TCT GCC 952 
Tyr Gly Leu Ala Ala Ser Ala Thr Leu Ala Ser Ala 
10 15 20 

TCC CCC ATC GAA GCC CGG GGA AGC TGC ACC TTC AAA 98 8 
Ser Pro lie Glu Ala Arg Gly Ser Cys Thr Phe Lys 
25 30 

ACC GCT GCT GCT GCC AAA GCG GGC AAG GCA GGG TGC 102 4 
Thr Ala Ala Ala Ala Lys Ala Gly Lys Ala Gly Cys 
35 40 45 

TCT ACC ATC ACC CTT GAC AAC ATC GAA GTC CCC GCT 10 60 
Ser Thr lie Thr Leu Asp Asn lie Glu Val Pro Ala 

50 55 

GGA ACC ACC CTC GAC CTG ACC GGT CTC ACC AGC GGT 10 96 

Gly Thr Thr Leu Asp Leu Thr Gly Leu Thr Ser Gly 
60 65 

ACG AAG GTC ATC TTC GAG GGC ACC ACG ACC TTC GAT 1132 
Thr Lys Val He Phe Glu Gly Thr Thr Thr Phe Asp 
70 75 80 
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TAT GAA GAA TGG GCA GGC CCC TTG ATC TCC ATG ACT 1168 

Tyr Glu Giu Trp Ala Gly Pro Leu lie Ser Met Ser 
85 90 ' 

GGC AAA GAT ATC ACC GTC ACT GGT GCC TCA GGC CAT 1204 

Gly Lys Asp He Thr Val Thr Gly Ala Ser Gly His 

95 100 105 

CTC ATC AAC TGC GAC GGT GCG CGG TGG TGG GAC GGC 124 0 

Leu lie Asn Cys Asp Gly Ala Arg Trp Trp Asp Gly 

110 115 

AAG GGG ACC AGC GGA AAG AAG AAG CCC AAG TTC TTC 1275 

Lys Gly Thr Ser Gly Lys Lys Lys Pro Lys Phe Phe 

120 125 



TAG GCT CAT GGC CTT GAC TCC TCG TCC ATT ACT GGA 1312 
Tyr Ala His Gly Leu Asp Ser Ser Ser He Thr Gly 
130 135 140 

TTG AAT ATC AAG AAC ACT CCC CTT ATG GCG TTT AGT 1348 
35 Leu Asn lie Lys Asn Thr Pro Leu Met A.la Phe Ser 

145 150 

GTT CAG GCG GAT GAC ATC ACT CTG ACT GAC ATT ACC 1384 

40 

Val Gin Ala Asp Asp He Thr Leu Thr Asp He Thr 
155 150 165 

ATC AAC AAC GCG GAC GGT GAT ACC CTG GGT GGA CAC 1420 
He Asn Asn Ala Asp Gly Asp Thr Leu Gly Gly His 

170 175 



AAC ACT GAT -GCG TTT GAT GTT GGT AAC TCT GTC GGT 145 6 
Asn Thr Asp Ala Phe Asp Val Gly Asn Ser Val Gly 
180 185 



55 



74 

BNSDOCID;<EP 0421919A2> 



EP 0 421 919 A2 



70 



15 



20 



25 



30 



35 



45 



50 



GTG AAT ATC ATC AAA CCG TGG GTC CAT AAC CAG GAT 14 92 
Vai Asn lie lie Lys Pro Trp Val His Asn Gin Asp 
190 195 ' 200 

GAG TGT CTT GCG ATC AAC TCT GGC GAG GTAAGCAGCT 152 9 
Asp Cys Leu Ala lie Asn Ser Gly Glu 
205 210 

TTGAACATAG ATTTGATTTG CATGTATGTT GATATTCTAT AG 1571 



AAC ATC TGG TTT ACC AGC GGC ACC TGC ATT GGC GGC 1607 

Asn lie Trp Phe Thr Ser Gly Thr Cys lie Gly Gly 

215 220 

CAC GGT CTC TCC ATC GGT TCT GTC GGC GGC CGC TCC 1643 

His Gly Leu Ser lie Gly Ser Val Gly Gly Arg Ser 
225 230 

AAC AJ^C GTT GTC AAG AAC GTC ACT ATC GAA CAC TCC 167 9 

Asn Asn Val Val Lys Asn Val Thr lie Glu His Ser 

235 240 245 



ACC GTG AGC AAT TCC GAG AAC GCC GTC CGG ATT AAG 1715 
40 Thr Val Ser Asn Ser Glu Asn Ala Val Arg lie Lys 

250 255 



ACC GTC TCT GGT GCC ACT GGT TCC GTG TCT GAG ATC 1751 

Thr Val Ser Gly Ala Thr Gly Ser Val' Ser Glu lie 
260 265 270 

ACA TAG TCC AAC ATT GTC ATG TCC GGC ATC TCC GAT 17 87 

Thr Tyr Ser Asn lie Val Met Ser Gly lie Ser Asp 

275 280 
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TAG GGC GTC GTT ATC GAG GAG GAT TAG GAG GAT GGC 182 3 

Tyr Gly Val Val He Gin Gin Asp Tyr Glu Asp Gly 
285 290 

AAG CCT ACG GGT AAG CCC ACG AAC GGT GTC ACT ATT 185 9 

Lys Pro Thr Gly Lys Pro Thr Asn Gly Val Thr He 
295 300 305 

ACG GAT GTC AAG CTG GAG AGC GTG ACT GGT ACT GTG 18 95 

Thr Asp Val Lys Leu Glu Ser Val Thr Gly Thr Val 
310 315 

GAT AGT AAG GCT ACT GAT ATC TAT CTC GTT TGC GGA 1931 

Asp Ser Lys Ala Thr Asp lie Tyr Leu Leu Cys Gly 

320 325 330 

TCT GGT AGC TGC TCG GAG TGG ACT TGG GAG GAT GTG 19 67 

Ser Gly Ser Cys Ser Asp Trp Thr Trp Asp Asp Val 

335 340 

AAG GTC ACT GGA GGA AAG AAG TCT ACT GCT TGC AAA 2 0 03 

Lys Vai Thr Gly Gly Lys Lys Ser Thr Ala Cys Lys 
345 350 

AAC TAG CCT TCG GTG GCT TCT TGC TAG GTTAGTAGGT 2 04 0 

Asn Tyr Pro Ser Val Ala Ser Cys 
355 360 362 



^5 TGTTCGGTTG TAGCACTTGC TAACATGCAT TTGCCTTGAG 2 080 

GGGTCAAATG GATTTGTGAA ATTATCGGTG TGAAAGAAGA 2120 

GATGGTGTCC AGTAAGATTC AGTGGTGGCA GCGTGTATAG 2160 

CTCTATACAA TGTTATTTAT CGCATAACTC TATATATCAA 2 200 

ATACTCGATT AAGAGAACCT GCCTTCAGCC ACAAATAACC 2240 

AAGTTCCTCG GCAGGTATCA GATTCCGGGA ACCCCTACCT 2280 

AGCCTTACTC CGTTGCAGAT AGTTCTGCTG GTAAATCAGG 2320 

ATGGTATGCT GAATTACGAT GCCAGCAAAT TCACCGCTAC 2 3 60 
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TCCAACCCGC GCATAAAACA TACGCCAGAA GTGCAAGGGA 2 400 

TAAAGACAGC CAGCTGTGTC TTCGCGCAAG TAACTCTGCA 244 0 

TAATCCAGTT TCTGACATAG TATAGTCATC TCTTCTACAC 24 80 

CTTGGCAAAC TCCCTCTCCA TAAACTCCCT CACAGTAATA 2520 

AGCGTCTCCC TCCCCTTCCC GCCCTTGGCC TTAAACGCCG 2 5 60 

CCTTCTTCAA CGCCGCAGCC GTTTGAATCT CCGTAGGCCC 2 600 

ATTACCAAAG TACTCCCCAT CCCTAAACTG CGAATAAATA 2 64 0 

AAAACATCCC GCTCCTGCTC ACGCCAATAC TCCCCTTTCG 2 68 0 

TATTCATGTC GTTCGGGTCC GCTAGAACCT CATACCGGGC 2 72 0 

TTTCTTGCCA GTCACTATAC CCATCACCGT CAGCATCTAC 27 60 

TCATCACTCA TCACGTAAAA GATAATGATA AGTAAAACCT 2 8 00 

TACCAGAAAC AAACGTATTC ACCACCTCCG CCGATGAAAC 284 0 

AATATCACTC GACCCCTGAA TCAGCCGTCT ATTCCACCGC 2 8 80 

AACGGGTTAA CAAAAACCCC ATGAACCAGA TCCCCAAAGT 2 920 

CATCCTTCAT ACTGATCCAG GGGAAATCTT CTTTCCCGCC 2 960 

CCAAAATGGC GTCTTGTATG TTAGATAACC CTCAGAGTCA 30 00 

GGGAATGTGG GCCATCCGCC GAATAAGTCG GCGTAGTCCG 30 4 0 

GCTCGAG 304 7 



35 



45 



50 



SEQ ID NO. 4 

SEQUENCE TYPE: Nucleotide with corresponding polypeptide 
SEQUENCE LENGTH: 2304 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: genomic 

ORIGINAL SOURCE ORGANISM: Aspergillus niger N400 
IMMEDIATE EXPERIMENTAL SOURCE: pGW1910 (DSM 5943) 

FEATURES: Gene pgaC encoding A. niger N4G0 prepro-polygalacturonaseC 

from 1 to 662: promoter region 

from 663 to 710: coding region for PGC signal peptide 

from 663 to 782: putative coding region for PGC prepro-sequence 

from 783 to 1995: putative coding region for mature PGC 

from 927 to 1001: intron 

from 1428 to 1483: intron. 

from 1614 to 1666: intron 

from 1996 to 1998: stop codon 

from 1999 to 2304: 3 non-coding region, part of transcriptional terminator region 
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CTGACCTAGT 


CCATTTCATT 


4 0 
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CACAACTATG 
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PTTTTPPTPP 




AA i b i A 1 ACj i 


TCGTGGGTGG 
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AALl CCCACjC 


ATAACATGGA 


150 
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GCGACAAACA 


TGGTACCACA 


o A r\ 

200 


10 


A T TP T TP A A T 


AAov^Ai i lAvj 


ACTCCGGCTT 


GGTTTTTTCC 


2 4 0 


VJ 1 ioCril iota 


/*' 'P/^/^T*/*^ T**T« 

i ULj 1 CjTT 


GTTCCTGCGC 


ACGGTGCAAC 


280 




PPTT A Tr'Tr'r' 


AOUAAi bAT i 


ACCTGCAGAA 


AGCCCATGGC 


320 






u ACjUCCj AC T T 


GAGTCCACCT 


AACAGCATAC 


360 


15 


TCAGCATACT 


CGATGCTGGG 


AATTACTTCA 


GGTATGGTCG 


400 




GCTGAATATG 


CAGTCGATTT 


CCGQCQCCKQ 


GAACAGTCCG 


440 




GCCCCCTCAC 


CACGGACATA 


GACCCGCCAC 


CAAAACGTCG 


480 


20 


ATAGCGGCCT 


CGCTTCGTCA 


ACGACCATTT 


TGCCCACCCC 


520 
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TGACACCCAG GAAAGACGTC TTCCAATAA.C AATATAAAAG 5 60 

GGCGAGTCTT GTCCTCTCAC TTTCCGTCTT TCGAGTATGC 60 0 

CTTGCCAGTT CCCAACTTGA CTTGAGACCT CCCATTTCGG 54 0 

TCATTTGTCA CCCGCTATAA GA ATG GTC CGT GAG CTT 67 7 

Met Val Arg Gin Leu 
1 5 

ATG CTG ATC AGC AGT CTG CTG GCA GCT GTT GOT GTG 713 
lie Leu lie Ser Ser Leu Leu Ala Ala Val Ala Val 

10 15 

CGC GCG OCT GCC GAT CCG GCT CAT CCC ATG GTT ACG 74 9 

Arg Ala Pro A.ia Asp Pro Ala His Pro Met Val Thr 
20 25 

25 GAA GCG CCT GAC GTC AAT TTG GTT GAA AAG AGG GCG 785 

Glu Ala Pro Asp Val Asn Leii Val Glu Lys Arg Ala 
30 35 40 



75 



20 



30 



35 
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50 



ACT ACT TGC ACC TTC TCG GGC TCC GAA GGT GCA TCC 821 

Thr Thr Cys Thr Phe Ser Gly Ser Glu Gly Ala Ser 
45 50 

AAG GCC AGC AAG TCG AAJV ACC TCT TGC TCC ACA ATC 8 57 

Lys Ala Ser Lys Ser Lys Thr Ser Cys Ser Thr lie 
55 60 65 

TAG CTG TCC GAC GTG GCC GTC CCA TCT GGC ACA ACC 8 93 

Tyr Leu Ser Asp Val Ala Val Pro Ser Gly Thr Thr 

70 75 

CTT GAT CTC TCT GAC CTG AAT GAT GGA ACC CAC 92 6 

Leu Asp Leu Ser Asp Leu Asn Asp Gly Thr His 
80 85 
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10 



15 



20 



30 



35 



40 



45 



GTACGTTCCC CGGGTGATTC ATGTCTTATT CTCACACCCA 966 



ATCGCGTCAA TAGTACTGGC TGACAGATGT ACTAG'GTG ATC TTC 1010 

Val He Phe 
90 

GAG GGA GAA AGO ACT TTT GGA TAG GAG GAA TGG GAA 104 6 

Gin Gly Glu Thr Thr Phe Gly Tyr Glu Glu Trp Glu 
95 100 

GGG CCT CTT GTG CGT GTT TCT GGA ACT GAT ATC ACG 1082 
Gly Pro Leu Val Arg Val Ser Gly Thr Asp He Thr 

110 115 



GTC GAG GGG GAG AGC GAG GCG GTG CTC AAT GGG GAT 1118 
Val Glu Gly Glu Ser Asp Ala Val Leu Asn Gly Asp 

120 125 

GGC AGC GGG TGG TGG GAT GGA GAG GGT GGG AAT GGT 1154 
Gly Ser Arg Trp Trp Asp Gly Glu Gly Gly Asn Gly 
130 135 



GGT A-AA ACA AAG GCG AAG TTC TTC TAT GCG CAT GAG 1190 
Gly Lys Thr Lys Pro Lys Phe Phe Tyr Ala His Asp 
1^0 145 150 

TTG AGC TCT TCC AGC ATC AAG AGC ATC TAG ATC GAG 122 6 

Leu Thr Ser Ser Thr lie Lys Ser He" Tyr lie Glu 
155 160 

AAC TCT CCT GTG GAG GTG TTC AGC ATC GAT GGC TCC 12 62 

Asn Ser Pro Val ' Gin Vai Phe Ser He Asp Gly Ser 
165 170 175 
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ACT GAT CTT ACQ ATG ACT GAT ATC ACG GTG GAT AAC 12 98 
Thr Asp Leu Thr Met Thr Asp lie Thr Val Asp Asn 
180 185 

ACG GAT GGT GAC ACG GAC GAC CTC GCC GCC AAT ACG 133 4 
Thr Asp Giy Asp Thr Asp Asp Leu Ala Ala Asn Thr 
190 , 195 

GAT GGC TTC GAC ATC GGG GAA AGC ACC TAT ATC ACG 137 0 
Asp Gly Phe Asp lie Gly Glu Ser Thr Tyr He Thr 
200 205 210 

ATC ACA GGT GCC GAA ATC TAC AAC CAA GAT GAC TGC 140 6 
He Thr Gly Ala Glu He Tyr Asn Gin Asp Asp Cys 
215 220 

GTT GCC ATC AAT TCT GGA GAG GTATGCGTCC CCTGGCACTG 1447 
Val Ala He Asn Ser Gly Glu 

225 230 - 

ATTGAGAGGA GAGGCGCCTT GCTGACGAT CTGGTAG AAC 148 6 

Asn 



ATT TAT TTC AGT GCC AGT GTG TGT TCT GGT GGT CAC 152 2 

He Tyr Phe Ser Ala Ser Val Cys Ser Gly Gly His 

235 240 

GGC CTG TCT ATT GGC TCT GTT GGT GGT CGG GAT GAT 1558 

Gly Leu Ser He Giy Ser Val Gly Gly Arg Asp Asp 

245 250 255 

AAC ACT GTC AAG AAC GTG ACG TTT TAT GAT GTC AAT 15 94 

Asn Thr Val Lys Asn Val Thr Phe Tyr Asp Vai Asn 

260 265 



81 



EP 0 421 919 A2 



GTT CTC AAG TCC GAG CAA GGTAAGGGAA AGGATTTGAA 1632 
Val Leu Lys Ser Gin Gin 
270 

CCATCCGTTT GCCGTCCTCT AACGTATGCT TTA GCA ATC CGT 167 4 

Ala lie Arg 
275 

ATC AAG AGO ATC TAG GGC GAG ACT GGG TCC GTC AGC 1710 
He Lys Thr He Tyr Gly Asp Thr Gly Ser Val Ser 
280 285 

GAA GTC ACT TAG CAT GAG ATT GCC TTT TCG GAC OCT 17 4 6 
Giu Val Thr Tyr His Glu lie Ala Phe Ser Asp Ala 
290 295 300 

ACT GAT TAG GGC ATT GTC ATT GAG GAG AAC TAT GAT 178 2 
Thr Asp Tyr Gly He Vai lie Giu Gin Asn Tyr Asp 
305 310 

GAC ACC TCC AAG ACC CCT ACT ACC GGG GTA CCG ATC 1818 
Asp Thr Ser Lys Thr Pro Thr Thr Gly Vai Pro lie ■ 
315 320 

ACG GAT TTT GTG CTC GAG AAC ATC GTT GGA ACG TGT 18 54 
Thr Asp Phe Vai Leu Glu Asn He Val Gly Thr Cys 
325 330 335 

GAA GAT GAT GAT TGT ACC GAA GTA TAG ATT GCC TGC 18 90 
Giu Asp Asp Asp Cys Thr Giu Val Tyr He Ala Cys 
340 345 

GGT GAT GGC AGC TGC TCG GAT TGG ACT TGG ACT GGC 192 6 
Gly Asp Gly Ser Cys Ser Asp Trp Thr Trp Thr Gly 
350 355 360 
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20 



GTG AGC GTG ACT GGC GGC AGT GTC AGT GAG GAG TGC IS 62 

Vai Ser Val Thr Gly Giy Ser Val Ser Asp Asp Cys 

365 370 

CTT AAT GTT CCC TCC GGG ATT AGC TGC GAT TTG TAG 1998 

Leu "Asn Val Pro Ser Gly lie Ser Cys Asp Leu 

375 380 383 



GCCGTTGTGA TTGGGGGAGA TGTTCCCGGG TACTCTGGTC 20 3 8 

75 GTTCGTTTAG CCAGCCCTTT CTGTTGAGTG GCCTGTTCTT 207 8 

CTATACTGTC GATTGTGTGT GAGATTACTA CCTTGCCCGA 2118 

GGAAAGAGCA GTGCATCCTA TCTTTTATTC TTTGCCGGTC 2158 

GGGGTTGGCC TACTAGTACA TTGTACCCGA AGAGTCAACG 2198 

GTGAAAGTAA AGGGCTTACA GAATTAGTCC AGGAGCAAAG 223 8 

ACAACTTTTA TCAGAGCGCA TCGGTCACGT CATGTTGAAA 2 27 8 

GTATTGATCA ATGAAGAT.AA CCTTGG 230 4 

25 



Claims 

30 

1. A recombinant DNA molecule comprising a DNA sequence selected from 

a) the Aspergillus niger DNA insert of pGW1800 or pGW1900. 

b) a DNA sequence which hybridizes to the coding region for the mature PGM or PGI comprised by a 
DNA insert of a) and which comprises a structural gene for a polypeptide with gaiacturonase or 

55 polygalacturonase activity, and optionally a promoter, a coding region for a signal or leader peptide 
and/or a transcriptional terminator, 

c) a derivative of a DNA sequence defined in a) or b), or 

d) a DNA sequence which codes for a mature PG or a signal peptide or a leader peptide thereof and 
which is degenerate within the meaning of the genetic code with respect to a DNA sequence of a), b) or 

^0 c). 

2. A recombinant DNA molecule according to claim 1 comprising the Aspergillus niger DNA insert of 
pG W1 800 or pG W1 900. 

3. A recombinant DNA molecule according to claim 1 comprising a DNA sequence which hybridizes to the 
coding region for the mature PGil or PGI comprised by a DNA insert of pGWIBOO or pGW1900 and which 

45 codes for a polypeptide with gaiacturonase or polygalacturonase activity, and/or a signal or leader peptide 
and/or has promoter and/or transcriptional terminator activity. 

4. A recombinant DNA molecule according to claim 1 comprising a derivative of the Aspergillus niger DNA 
insert of pGW1800 or pGW1900, or of a DNA sequence which hybridizes to the coding region for the 
mature PGI! or PGI comprised by a DNA insert of pGW180D or pGW1900 and which codes for a 

50 polypeptide with gaiacturonase or polygalacturonase activity, and/or a signal or leader peptide and/or has 
promoter and/or transcriptional terminator activity. 

5. A recombinant DNA molecule according to claim 4 comprising an insert or fragment thereof of a vector 
selected from the group of vectors consisting of XPG-A9, -A10. -A43, -B4, -B35, -B36, -C1, -Cl6, -G20, 
-C37, -D8. -D11. -D12, -E6, -E38. -F5, -G17, -G27. -X31 and -Y33. pGW1755. pGWlQlO. pGW1800, 

55 - PGW1900 and pGW1756. 

6. A recombinant DNA molecule according to claim 4 comprising a DNA fragment derived from an A. niger 
insert of pGWISOO or pGWigoo, or a DNA sequence which hybridizes to the coding region for the~mature 
PGII or PGI comprised by one of said DNA inserts and encodes a structural gene and/or a leader or signal 
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peptide and/or comprises a sequence with promoter and/or transcriptional terminator activity. 

7. A recombinant DNA molecule according to claim 6 comprising a DNA fragment with promoter activity 
derived from an insert of a vector selected from the group of vectors consisting of XPG-A9, -A10, -A43, -B4, 
-B35. -B36. -C1, -C16, -C20. -037. -D8. -D11. -Dl2, -E6, -E38. -F5. -G17, -G27. -X31 and -Y33.'pGWl91 o' 

5 pGW1800. pGW1900 and pGW1756, 

8. A recombinant DNA molecule according to claim 6 comprising a DNA fragment encoding a leader 
peptide derived from an insert of a vector selected from the group of vectors consisting of XPG-A9. -A10 
'A43, -64. -B35. -B36, -01. -C16, -020. -037, -D8. -011. -012. -E6. -E38. -F5, -G17. -G27, -X31 and -Y33' 
pG W1 910, pGWI 800. pG W1 900 and pGWl 756. 

10 9- A recombinant DNA molecule according to claim 6 comprising a DNA fragment encoding a signal 
peptide derived from an insert of a vector selected from the group of vectors consisting of XPG-A9, -A10, 
-A43. -B4. .835, -B36, -01, -016, -020. -037. -D8, -011. -D12. -E6. -£38. -F5, -G17. -G27. -X31 and -Y33' 
pG W1 91 0, pG W1 800, pG W1 900 and pGW1 756. 

10. A recombinant DNA molecule according to claim 6 comprising a DNA fragment with transcriptional 
15 terminator activity derived from an insert of a vector selected from the group of vectors consisting of XPG- 

A9. -A10, -A43, -B4, -B35. -B36, -01. -016. -020. -037. -D8. -DII, -012, -E6. -E38. -F5, -G17, -027. -X31 
and -Y33. pGWl910, pGWIBOO, pGW1900 and pGW1756. 

11. A recombinant DNA molecule according to claim 6 comprising a DNA fragment encoding a structural 
gene for a. galacturonase or polygalacturonase or a fragment thereof with gaiacturonase or polygalac- 

20 turonase activity derived from an insert of a vector selected from the group of vectors consisting of XPG-A9 
-A10. -A43. -B4. -B35, -B36, -01. -016, -020. -037. -08. -Oil, -D12. -E6. -E38, -F5, -G17, .G27. -X31 and 
-Y33, pGW1910. pGWiBOO, pGWIGOO and pGW1756. 

12. A recombinant DNA molecule according to claim 1 which is an expression cassette. 

13. A recombinant DNA molecule according to claim 1 which is a hybrid vector. 

25 14. A recombinant DNA molecule according to claim 13 selected from the group of vectors consisting of 
pGW 1800. pGW 1900, pGW 1756. pGW 1910. XPG-A9, XPG-AIO. XPG-A43. XPG-B4, XPG-B35. XPG-B36 
XPG-C1. XPG-016. XPG-020. \PG-037, XPG-D8. XPG-D11, \PG-D12, \PG-E6, XPG-E38, XPG-F5' XPG-G17 
XPG-G27. XPG-X31 and XPG-Y33. 

15. A DNA molecule comprising a gene for a PG or a functional fragment thereof which is isolated from a 
30 PG producing filamentous fungal strain. 

16. A DNA molecule according to claim 15 which is isolated from Aspergillus spec . 

17. A DNA molecule according to claim 15 which is an insert of a hybrid vecToFleiected from the group of 
hybrid vectors consisting of pGW1800. pGW 19Q0, pGW 1756. pGW 1910. XPG-A9, XPG-A10. XPG-A43, 
XPG-B4. XPG-B35. XPG-B36. XPG-Cl. XPG-G16, XPG-020, XPG-C37. XPG-D8. XPG-D11, XPG-D12 XPG-Eo' 

35 XPG-E38. XPG-F5. XPG-G17. XPG-G27. XPG-X31 and XPG-Y33 or a fragment thereof comprising a 
promoter region of a PG gene, or a coding region for a signal peptide, leader peptide, prepro-Po! PG or a 
fragment of a PG with galacturonase or polygalacturonase activity, or a transcriptional terminator region of a 
PG gene. 

18. A process for the preparation of a recombinant DNA molecule according to claim 1 or a DNA molecule 
40 according to claim 15 comprising 

a) isolating genomic DNA from suitable fungal cells, and selecting the desired DNA, e.g. using a DNA 
probe or using a suitable expression system and screening for expression of the desired polypeptide, or 

b) isolating mRNA from suitable fungal cells, selecting the desired mRNA. e.g. by hybridization with a 
DNA probe or by expression in a suitable expression system and screening for expression of the desired 

45 polypeptide, preparing single-stranded cDNA complementary to that mRNA, then double-stranded cDNA 
therefrom, or 

c) isolating cDNA from a cDNA library and selecting the desired cDNA. e.g. using a DNA probe or using 
a suitable expression system and screening for expression of the desired polypeptide, or/and 

d) incorporating the double-stranded DNA of step a), b) or c) into an appropriate vector, 
50 e) transforming appropriate host cells with the obtained hybrid vector, 

f) selecting transformed host cells which contain the desired DNA from untransformed host cells or 
multiplicating the transformed host cell, and, when required, 

g) isolating the desired DNA and/or converting the DNA into a mutant or fragment thereof. 
^ 19. A host transformed with a recombinant DNA molecule according to claim 1. 

55 20. A process for the preparation of a transformed host according to claim 19 comprising treatment of a 
suitable host cell under transforming conditions with a recombinant DNA molecule of the present invention, 
especially a hybrid vector of the invention, optionally together with a selection marker gene and optionally 
selecting the transformants. 
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21. A process for the preparation of a polypeptide characterized In that a structural gene inserted in an 
expression cassette according to claim 12 is expressed in a suitable transformed host according to 
conventional methods and optionally the polypeptide is isolated in conventional manner. 

22. A process according to claim 21 wherein an Aspergillus niger strain is used as host, 

5 23. A process according to claim 21 wherein an Aspergillus nidulans strain is used as host. 

24. A process according to claim 21 wherein the structural gene encpdes the hybrid interferon BDBB. 

25. A process according to claim 21 wherein the structural gene encodes a polypeptide with galacturonase 
or polygalacturonase activity and is derived from a DNA sequence according to claim 1. 

26. A process for the preparation of a single PG wherein such PG is purified from a crude source according 
70 to conventional methods. 

27. A process according to claim 26, wherein a PG is produced in purified form from a crude enzyme 
mixture obtained from Aspergillus niger . 

28. A single galacturonase or polygalacturonase encoded by a DNA sequence according to claim 1 or a 
derivative thereof with galacturonase or polygalacturonase activity and biologically acceptable salts thereof. 

75 29. An enzymatic composition comprising a polypeptide according to claim 28 or any mixture thereof 
optionally in a predetermined combination with one or more enzymes having other than PG activity. 
30. The use of a polypeptide according to claim 28 or of an enzymatic composition according to claim 29 in 
plant material processing, 

20 Claims for the following contracting states ES, GR: 

1, A process for the preparation of a recombinant DNA molecule comprising a DNA sequence selected from 

a) the Aspergillus niger DNA insert of pGW1800 or pGW1900, 

b) a DNA sequence which hybridizes to the coding region for the mature PGM or PGI comprised by a DNA 
25 insert of a) and which comprises a structural gene for a polypeptide with galacturonase or polygalac- 
turonase activity, and optionally a promoter, a coding region for a signal or leader peptide and/or a 
transcriptional terminator. 

c) a derivative of a DNA sequence defined in a) or b), or 

d) a DNA sequence which codes for a mature PG or a signal peptide or a leader peptide thereof and which 
30 is degenerate within the meaning of the genetic code with respect to a DNA sequence of a), b) or c), 

which process comprises 

a) isolating genomic DNA from suitable fungal cells, and selecting the desired DNA, e.g. using a DNA 
probe or using a suitable expression system and screening for expression of the desired polypeptide, or 

b) isolating mRNA from suitable fungal cells, selecting the desired mRNA, e.g. by hybridization with a 
35 DNA probe or by expression in a suitable expression system and screening for expression of the desired 

polypeptide, preparing single-stranded cDNA complementary to that mRNA, then double-stranded cDNA 
therefrom, or 

c) isolating cDNA from a cDNA library and selecting the desired cDNA. e.g. using a DNA probe or using 
a suitable expression system and screening for expression of the desired polypeptide, or/and 

40 d) incorporating the double-stranded DNA of step a), b) or c) into an appropriate vector, 

e) transforming appropriate host cells with the obtained hybrid vector, 

f) selecting transformed host cells which contain the desired DNA from untransformed host cells or 
multiplicating the transformed host cell, and, when required, 

g) isolating the desired DNA and/or converting the DNA into a mutant or fragment thereof. 

45 2. A process according to claim 1 for the preparation of a recombinant DNA molecule comprising the 
Aspergillus niger DNA insert of pGWISOO or pGW1900. 

3. A process according to claim 1 for the preparation of a recombinant DNA molecule comprising a DNA 
sequence which hybridizes to the coding region for the mature PGII or PGI comprised by a DNA insert of 
pGWISOO or pGW1900 and which codes for a polypeptide with galacturonase or polygalacturonase activity, 

50 and/or signal or leader peptide and/or has promoter and/or transcriptional terminator activity. 

4. A process according to claim 1 for the preparation of a recombinant DNA molecule comprising a 
derivative of the Aspergillus niger DNA insert of pGWISOO or pGW1900. or of a DNA sequence which 
hybridizes to the coding region for the mature PGII or PGI comprised by a DNA insert of pGWISOO or 
pGW1900 and which codes for a polypeptide with galacturonase or polygalacturonase activity, and/or signal 

55 or leader peptide and/or has promoter and/or transcriptional terminator activity. 

5. A process according to claim 4 for the preparation of a recombinant DNA molecule comprising an insert 
or fragment thereof of a vector selected from the group of vectors consisting of XPG-A9. -A10, -A43, -B4. 
-B35, -B36, -C1, -CI 6, -C20, -C37, -D8, -D11, -D12, -E6, -E38, -F5, -G17. -G27, -X31 and -Y33, pGW1756. 
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pGW1910, pGW1800, pGW1900 and pGW1756. 

6. A process according to claim 4 for the preparation of a recombinant DNA molecule comprising a DNA 
fragment derived from the A. niger insert of pGWIBOO or pGW1900, or a DNA sequence which hybridizes 
to the coding region for the mature PGII or PGI comprised by one of said DNA inserts and encodes a 

5 Structural gene and/or a leader or signal peptide and/or comprises a sequence with promoter and/or 
transcriptional terminator activity. 

7. A process according to claim 4 for the preparation of a recombinant DNA molecule comprising a DNA 
fragment with promoter activity derived from an insert of a vector selected from the group of vectors 
consisting of XPG-A9, -A10. -A43. -B4, -B35. -B36, -C1. -C16, -G20. -037. -DB, -D11, -D12, -E6, -E38, -F5, 

70 -G17. -G27. -X31 and -Y33, pGW1910. pGWIBOO, pGW1900 and pGW1756. 

8. A process according to claim 4 for the preparation of a recombinant DNA molecule comprising a DNA 
fragment encoding a leader peptide derived from an insert of a vector selected from the group of vectors 
consisting of XPG-A9. -A10, -A43. -B4, -835, -836, -C1. -C16, -C20, -C37, -DB, -D11, -D12, -E6, -E38, -F5, 
-G17. -G27, -X31 and -Y33, pGW1910. pGWIBOO, pGW19aO and pGW1756. 

15 9. A process according to claim 4 for the preparation of a recombinant DNA molecule comprising a DNA 
fragment encoding a signal peptide derived from an insert of a vector selected from the group of vectors 
consisting of \PG-A9, -A10. -A43, -B4, -835, -836, -01. -C16. -C20. -037. -DB. -D11. -D12, -E6, -E3B, -F5, 
-G17. -G27. -X31 and -Y33. pGW1910. pGWIBOO, pGW1900 and pGW1756. 

10. A process according to claim 4 for the preparation of a recombinant DNA molecule comprising a DNA 
20 fragment with transcriptional terminator activity derived from an insert of a vector selected from the group of 

vectors consisting of XPG-A9, -A10, -A43, -84, -835, -836, -01, -016. -020. -037, -DB, -D11. -D12, -E6. 
-E38, -F5, -G17, -G27, -X31 and -Y33, pGW1910. pGW1800. pGW1900 and pGW1756, 

11. A process according to claim 4 for the preparation of a recombinant DNA molecule comprising a DNA 
fragment encoding a structural gene for a galacturonase or polygalacturonase or a fragment thereof with 

25 galacturonase or polygalacturonase activity derived from an insert of a vector selected from the group of 
vectors consisting of XPG-A9. -A10, -A43, -84, -835, -836. -01, -016, -020, -037. -DB, -D11. -D12, -E6. 
-E3B. -F5, -G17, -G27. -X31 and -Y33, pGWl910, pGWIBOO. pGW1900 and pGW1756. 

12. A process according to claim 1 for the preparation of a recombinant DNA molecule according to claim 1 
which is an expression cassette. 

30 13- A process according to claim 1 for the preparation of a recombinant DNA molecule which is a hybrid 
vector. 

14. A process according to claim 1 for the preparation of a recombinant DNA mo)ecu!e selected from the 
group of vectors consisting of pGWIBOO. pGW 1900, pGW 1756, pGW 1910, XPG-A9. XPG-A10, XPG-A43, 
\PG-84. XPG-B35. XPG-B36, XPG-01, XPG-016, XPG-O20, XPG-037. XPG-D8. XPG-D11, XPG-D12. XPG-Ee! 

35 XPG-E3B, XPG-F5, XPG-G17. XPG-G27. XPG-X31 and XPG-Y33. 

15. A process for the preparation of a DNA molecule comprising a gene for a PG or a functional fragment 
thereof which is isolated from a PG producing filamentous fungal strain which process comprises 

a) isolating genomic DNA from suitable fungal cells, and selecting the desired DNA, e.g. using a DNA 
probe or using a suitable expression system and screening for expression of the desired polypeptide, or 
40 b) isolating mRNA from suitable fungal cells, selecting the desired mRNA, e.g. by hybridization with a 
DNA probe or by expression in a suitable expression system and screening for expression of the desired 
polypeptide, preparing single-stranded cDNA complementary to that mRNA, then double-stranded cDNA 
therefrom, or 

c) isolating cDNA from a cDNA library and selecting the desired cDNA. e.g. using a DNA probe or using 
45 a suitable expression system and screening for expression of the desired polypeptide, or/and 

d) incorporating the double-stranded DNA of step a), b) or c) into an appropriate vector, 

e) transforming appropriate host cells with the obtained hybrid vector, 

f) selecting transformed host cells which contain the desired DNA from untransformed host cells or 
multipiicating the transformed host ceil, and, when required, g) isolating the desired DNA and/or 

50 converting the DNA into a mutant or fragment thereof. 

16. A process according to claim 15 for the preparation of a DNA molecule which is isolated from 
Aspergillus spec . 

17. A process according to claim 15 for the preparation of a DNA molecule which is an insert of a hybrid 
vector selected from the group of hybrid vectors consisting of pGW1800, pGW 1900, pGW 1756, pGW 

55 1910, XPG-A9. XPG-A10. XPG-A43, XPG-B4. XPG-B35, XPG-836. XPG-01, XPG-016, XPG-020, XPG-037. 
XPG-D8i XPG-D11, XPG-D12, XPG-E6, XPG-E38, XPG-F5. XPG-G17. XPG-G27. XPG-X31 and XPG-Y33 or a 
fragment thereof comprising a promoter region of a PG gene, or a coding region for a signal peptide, leader 
peptide, prepro-PG. PG or a fragment of a PG with galacturonase or polygalacturonase activity, or a 
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transcriptional terminator region of a PG gene. 

18. A process for the preparation of a host transformed with a recombinant DNA molecule comprising a 
DNA sequence selected from 

a) the Aspergillus niger DNA insert of pGW1800 or pGW1900, 
5 b) a DNA sequence which hybridizes to the coding region for the mature PGII or PGI comprised by a DNA 
insert of a) and which comprises a structural gene for a polypeptide with galacturonase or polygalac- 
turonase activity, and optionally a promoter, a coding region for a signal or leader peptide and/or a 
transcriptional terminator, 

c) a derivative of a DNA sequence defined in a) or b), or 

10 d) a DNA sequence which codes for a mature PG or a signal peptide or a leader peptide thereof and which 
is degenerate within the meaning of the genetic code with respect to a DNA sequence of a), b) or c), 
comprising treatment of a suitable host cell under transforming conditions with a recombinant DNA 
molecule of the present invention, especially a hybrid vector of the invention, optionally together with a 
selection marker gene and optionally selecting the transformants. 

15 19. A process for the preparation of a polypeptide characterized in that a structural gene inserted in an 
. expression cassette comprising a DNA sequence selected from 

a) the Aspergillus niger DNA insert of pGWISOO or pGW1900, 

b) a DNA sequence which hybridizes to the coding region for the mature PGII or PGI comprised by a DNA 
insert of a) and which comprises a structural gene for a polypeptide with galacturonase or poiygalac- 

20 turonase activity, and optionally a promoter, a coding region for a signal or leader peptide and/or a 
transcriptional terminator. 

c) a derivative of a DNA sequence defined in a) or b), or 

d) a DNA sequence which codes for a mature PG or a signal peptide or a leader peptide thereof and which 
is degenerate within the meaning of the genetic code with respect to a DNA sequence of a), b) or c), 

25 is expressed in a suitable host according to conventional methods and optionally the polypeptide is isolated 
in conventional manner. 

20. A process according to claim 19 wherein an Aspergillus niger strain is used. 

21. A process according to claim 19 wherein an Aspergillus nidulans strain is used. 

22. A process according to claim 19 wherein the structural gene encodes the hybrid interferon BDBB. 

30 23. A process according to claim 19 wherein the structural gene encodes a polypeptide with galacturonase 
or polygalacturonase activity and is derived from a DNA sequence selected from 

a) the Aspergillus niger DNA insert of pGWISOO or pGW1900, 

b) a DNA sequence which hybridizes to the coding region for the mature PGII or PGI comprised by a 
DNA insert of a) and which comprises a structural gene for a polypeptide with galacturonase or 

35 polygalacturonase activity, and optionally a promoter, a coding region for a signal or leader peptide 
and/or a transcriptional terminator, 

c) a derivative of a DNA sequence defined in a) or b), or 

d) a DNA sequence which codes for a mature PG or a signal peptide or a leader peptide thereof and 
which is degenerate within the meaning of the genetic code with respect to a DNA sequence of a), b) or 

40 C). 

24. A process for the preparation of a single PG wherein such PG is purified from a crude source according 
to conventional methods. 

25. A process according to claim 24, wherein a PG is produced in purified form from a crude enzyme 
mixture obtained from Aspergillus niger . 

45 26. A process for the preparation of a single galacturonase or polygalacturonase encoded by a DNA 
sequence according to clainn 1 or a derivative thereof with galacturonase or polygalacturonase activity and 
biologically acceptable salts thereof. 

27. A process for the preparation of an enzymatic composition comprising a polypeptide according to claim 
26 or any mixture thereof optionally in a predetermined combination with one or more enzymes having 

50 other than PG activity. 

28. A process for plant material processing in which a polypeptide according, to claim 26 or an enzymatic 
composition according to claim 27 is used. 
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FIGURE 1 
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FIGURE 3 



Jl SaJl. 250 



7S00. /<pnl 
7400, 




pGwigoo 

11300 bp 



62B0. £cd^X ^ 

5780. SphT 




■NCOl, 2730 



Sp/7l, 3350 
ACs-il. 2570 



//i/7dIII, 3SS0 



. 5710. 5"/;7<3l /Vi/7dIII. 5550 



4400 
Kpnl, 4700 
Sc&\y. 47 EO 

/r/9/7l^ 5050 



90 

BNSOOCID: <EP 0421919A2> 



EP 0 421 919 A2 



FIGURE 4 
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PCR with CLISOs A and B : 



FIGURE 5 
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FIGURE 5 continued 
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FIGURE 6 
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FIGURE 7 
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